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Field of Invention 

This invention pertains generally to voice-recognition based or speech- 
recognition-based interactive electronic commerce, and more particularly to systems, 
methods, and methods of doing business for providing automated interactive directory 
assistance information from a business or organization to a consumer in need of goods 
and/or services. The invention pertains even more particularly to systems, methods, and 
methods of doing business for providing automated speech-recognition driven query 
and response with business or event self-promotion features relative to businesses and 
events over ordinary wired or wireless telephone systems, PC systems, Personal Data 
Assistants (PDAs), and other communication and information appliances and devices. 

BACKGROUND 

Locating business establishments, such as for example a restaurant satisfying the 
particular need of a customer, has hereto for generally required access to printed 
directory listings, or more recently access to the World Wide Web using a personal 
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computer. The availability of such references is frequently quite limited at the time the 
consumer desires to avail themselves of particular goods or services. For example, an 
out-of-town visitor driving in their automobile and approaching San Francisco might 
decide to stop and have dinner in a fine Italian restaurant and more particularly might 
like to have dinner in a fine Italian restaurant located in the particular area of the city., 
or to partake of a particular gastronomic delicacy. That visitor would likely not have a 
printed directory in their automobile or mobile access to the Internet to search for a 
restaurant satisfying their current need. Therefore, the visitor would likely either have 
to stop and asked for recommendations or drive around until a restaurant satisfying their 
needs might be located. Perhaps tens of minutes or hours later than desired. This 
approach is clearly inefficient, and the visitor may not have the dining experience 
expected if the restaurant they happen to see while driving turns out to have poor quality 
food, poor service, or both. 

An analogous dilemma arises for other goods and services, whether provided 
to the local residents or to a visitor from out of the area. Frequently information is not 
available to a consumer (perspective purchaser) when he or she needs such information, 
and with the proliferation of a fast mobile lifestyle, there exists and need to provide 
such consumer information with readily available information appliances, such as 
conventional telephones, cellular phones, or other pocket or mobile devices that can 
provide connectivity to a service at minimum cost. 

Frequently such device will have only sparse input/output capabilities. For 
example, a cellular telephone will typically have only a few display lines presenting text 
or symbolic data to a user, but has substantial audio input and audio output capability 
that can be used by the consumer. 

Heretofore, speech-to-text conversion has generally been limited to word 
processing and or computer or control applications as the has required fairly substantial 
processing power and memory within a computer device. For example, speech to text 
conversion products made by Dragon Systems (Lernout & Hauspie, Speech Products 
U.S.A., Inc., 52 Third Avenue, Burlington, MA 01803) generally require an Intel 
Pentium II or Pentium III microprocessor, AMD K-6 or Athlon processor, or the like 
running in excess of 450 MHz and 128 MB of memory. This technology is not 
available in conventional or mobile telephones at this time. Text- to-speech conversion 
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has been known but has not been utilized to provide an interactive interface between 
consumers and consumer information from telephone systems. Such continuous speech 
recognition systems also usually require voice training with the ultimate user to provide 
satisfactory results. 

Furthermore, even for systems which provided some degree of consumer 
information over the telephone, such systems have either not attempted to generate 
business revenues through their operation, or have been unsuccessful in generating 
significant revenue in this manner. In part the lack of revenue success has been due to 
a low level of business participation in such systems, the inability of a business to 
control or modify their message in response to short-term business needs or to sell 
promote their businesses, as well as the lack of a particular incentive for a consumer to 
par take all of the information offered by the service. In fact, there may frequently have 
been a cost associated access to conventional information and referral services by 
consumers, even if only by virtue of the directory assistance by local telephone service 
providers. 

Some conventional systems and methods have been limited to playback of 
recorded audio or audio playback corresponding to the content of web pages; but such 
systems have not integrated Internet or web-based interactions with voice or telephone 
based information provision. They have also frequently provided inferior voice 
interfaces that have annoyed callers rather than having provided a useful information 
experience. 

Local as well as national businesses (really any business, merchant, marketing, 
or other organization) and their customers have made significant sacrifices in order to 
find each other and conduct a transaction for goods or services. These sacrifices result 
at least in part from the nature of the information resources available to match 
businesses and their customers, primary among these information sources are the 
printed Yellow Pages, on-line Yellow Pages, conventional 411 directory assistance, 
print media coupons, and on-line coupons. 

Businesses (particularly local businesses) make sacrifices when relying on 
printed Yellow Pages because they are expensive, difficult to sort prices and options, 
cannot be changed once in print, and their effectiveness cannot readily be determined 
without additional time consuming and expensive surveys. From customers, or 
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potential customers, perspective sacrifices are made because they are too heavy to carry, 
present too many choices so that it is difficult for a customer to figure out which one 
to call as they usually lack the information that may effectively provide the decision 
criteria, the information is usually stale (12-16 months old is typical), and many 
customers may forget to take advantage of coupon offers even when available. Printed 
Yellow Pages and related types of large bulky paper directories clearly have other 
problems and limitations. 

Even on-line or Internet Yellow Pages or directories have limitations that 
present sacrifices for both merchants and their potential customers. From the 
merchant's perspective, for example, there are many different on-line directory 
providers and a business must make a decision as to which one or ones to associate 
with, a personal computer is required to access and update such directories and may not 
be available when and where needed, updating may require payment to an ISP or 
programmer, the directory reaches only those with an Internet connection, and 
additional fees must normally be paid to place, ad banners. From a customer's 
perspective, the information provided is frequently inaccurate and limited, an interne 
connection is required to access the information, and either a personal computer or 
WAP enabled device is required to access. 

Local directory assistance and more recently national directory assistance 
through either a 41 1 type information service is also limited. From a local merchant's 
perspective, even when a potential customer receives a correct number the connection 
may not be made because either the customer must redial to get the number or pay an 
additional fee to be connected. From a customer's perspective, there is relatively easy 
access but at a cost of between about $0.50 and $1.00 per call. The directory, 
particularly when using the wireless directory assistance, is notorious for providing 
wrong numbers and there is no ability to get the correct number without re-dialing and 
paying an additional fee. Additional charges are also typically billed for requesting 
address information if available. 

Reliance on print media coupons also entail sacrifices. For example, they are 
static and cannot be changed once in print. It takes a relatively long period of time 
between developing the coupon promotion and getting feedback as to its success or 
failure. Redemption dates cannot be changed, and one merchants coupons frequently 
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get lost in the noise of other unrelated or competing coupons. Customers find it 
difficult to identify relevant coupons, find it a hassle and hardly worth the time or effort 
to cut them out or save them, and have a difficult time keeping tract of expiration dates. 
In some social settings, they present a questionable social image, and cannot always be 
carried with the person so they are not available when an occasion to use them arises. 

On-line coupons are somewhat of an improvement however they still present 
issues. They are expensive to place at high traffic portals for local merchants, and may 
still be lost in the noise of other promotions. There is low traffic at coupon only sites 
and their reach is limited due to the need for an Internet connection. The use of 
coupons may also be favored by groups that may not have ready access to the Internet. 
Customers also find them hard to locate and a hassle to pint, copy, or cut out. Access 
by many groups of persons, or by persons at the time they consider making a purchase 
may be limited. 

In addition to the limitations and sacrifices made by merchants and customers 
relative to establishing a contact for the provision of goods or services, there currently 
exist additional problems and limitations relative to wireless data communication of 
advertising information. For example, infrastructure and methods have not been 
established or are in their infancy and must be developed before true wireless 
advertising can become widely available and accepted. Data ready telephone handsets 
exist but are in the minority and typically represent higher-end and more expensive 
models. Mobile phones in the hands of the majority of consumers do not provide for 
wireless interaction with merchants or for the receipt of audio advertising, marketing, 
or promotional information. Nor are they typically used for voice recognition 
applications. Merchants have been hesitant to participate in wireless data transactions 
and the lack of consumer interest in marketing messages generally have contributed to 
lack of development and progress in this area. 

Techniques for building speech recognition applications are some what in their 
early development stage, however, some information is provided in the reference How 
to Build a Speech Recognition Application - A Style Guide for Telephony Dialogues, 
by Bruce Balentine et al., ISBN 0-9671278-1-5, published in 1999 by Enterprise 
Integration Group, Inc., 2410 San Ramon Valley Blvd., Suite 225, San Ramone, 
California 94583; which reference text is hereby incorporated by reference. 
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Therefore there remains a need for a system, method, and business operating 
model and method that overcome these and other limitations. More particularly, there 
remains a need for a method of doing business, an information, directory assistance 
service and referral service providing easy access by businesses and consumers, as well 
5 as providing business self promotion and consumer feedback features that encourage 

use, generate revenues, and provide incentives for use by both businesses and 
consumers. Such services should advantageously provide more information than 
traditional 411 type directory assistance in terms of greater amounts of information, 
greater accuracy and currency of information, award programs, and other features that 
10 encourage businesses to participate and consumers to call and utilize such systems. 

SUMMARY 

The invention provides a system, method, and business model for an 
information system and service having business self-promotion features, including 
03 15 \ voice coupons, discounts, sales promotions or other special deals, ratings, and different 
j\ \ categories of sponsorship and visibility to the calling public. In one aspect, the 

inventive business model is directed to a business in which consumers call into a 
^ervice using an ordinary telephone, PC, PDA, or other information appliance, and 
\Hiake requests in plain speech for information and positive referrals on goods and/or 
20 sjirvices, and the service provides responses to the request in plain speech in real-time 

oijgr the same telephone, PC, PDA, or other information appliance. The business model 
may further include providing a facility for a business to communicate a self-promotion 
of the business to the requestor, as well as providing an audio promotional coupon (or 
other promotional item) to a requestor when the requestor completes a call to a business 
25 using the service. 

In another aspect the inventive system extends these features to the Internet and 
inter-operates with the Internet in a web page based embodiment to provide an 
additional portal and greater interactive capability. 

In another aspect, the invention provides an operating model for a telephone- 
30 based audio or speech recognition and text to voice interfaced goods and services 

information, enhanced 411 type directory assistance and referral service having 
merchant self-promotion features, comprising: an information database provider storing 
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merchant information; a merchant interface for inputting merchant information into the 
database and for retrieving and editing the information; and a consumer interface for 
inputting voice commands and data and for receiving merchant information and 
processed information from the database in response to the input voice commands and 
data. The operating method may provide that the consumer interface comprises a 
telephone handset, and/or that the consumer also inputs non-voice commands and data 
from a keypad on the telephone handset. The operating model may also provide that 
the telephone handset comprises a mobile telephone. 

In another aspect, the invention provides a system comprising: a speech-to-text 
conversion engine converting speech-based input commands and data received from an 
external device over a communication link into text-based commands and data; a data 
base storing a plurality of data items; a database search engine searching the database 
for a particular data item in response to the text-based command and data; a text-to- 
speech conversion engine generating a speech-based representation of the particular 
data item identified in the database search; and a speech server for communicating the 
speech-based representation of the particular data item to the external device. 

In another aspect the invention provides audio coupons that operate as 
incentives for consumers to use the inventive system. 

In still another aspect, the invention provides system and methods for 
submitting and retrieving ratings for goods and/or services. 

In yet another aspect, the inventive system and method assists in providing 
directory driven wireless commerce. 

In yet another aspect, the inventive system and method provide a promotion and 
advertising channel that has geographical and sociological reach and the speed needed 
in today's dynamic financial and commercial markets. 

In still another aspect, the inventive system and method provide a voice- 
interactive dynamic marketplace where individuals (particularly locals) call to save and 
businesses (particularly local merchants) call to publish sales promotions in real-time 
or near real-time. 

In still another aspect, the inventive system and method communicate 
information on an as requested basis that goes beyond the telephone number. 

In yet another aspect, the inventive system and method provide instant savings 
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with voice coupons published by local merchants. 

In still another aspect, the inventive system and method provide an advertising 
free initial experience where voice or audio coupons are only heard attached to 
businesses that the caller has requested or searched for. 
5 In even another aspect, the inventive system and method provide for hands-free 

navigation with voice commands on any telephone or device supporting telephony. 

In yet another aspect, the invention provides a low cost 411 directory assistance 
with added informational features that is easy to access. 

In yet another aspect, the invention provides benefits to merchants including but 
10 not limited to targeted reach, instant promotion, instant or near- term feedback, and an 

optional free Internet web presence. 

In still another aspect, the invention provides benefits to common carriers and 
telephone companies who save conventional 411 costs, process higher call volumes, 
and attract new customers. 
1 5 In yet another aspect, the invention provides a business model in which voice 

coupons are sold for distribution, a monthly fee is charged for subscription to the basic 
service, and additional charges are levied and collected for business category 
sponsorship. 

In still another aspect, the invention provides a business model in which the 
20 providing organization partners with a print yellow page or other business directory 

publisher and/or with direct marketing organizations to subscribe merchants, 
businesses, individual professionals, or other organizations. 

In a further aspect, the invention provides a business model in which the 
providing organization partners with a yellow page, wireless providers, telephone 
25 companies, and conventional 411 call centers to generate call traffic and thereby 

increase revenue. 

In yet another aspect, the invention provides a business model in which 
conventional 411 directory assistance providers are replaced by the inventive system to 
save carriers 411 costs and to offer a new shared revenue channel and business model. 
30 In another aspect, the invention provides a business model in which new 

customers are solicited and provided with value added services. 

In still another aspect, the invention provides a business model in which direct 
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marketers are provided with a new coupon or promotion distribution channel. 

In still another aspect, the invention provides a business model in which new 
business is brought to direct marketing organizations through the voice channel of the 
inventive system. 

In still another aspect, the invention provides a business model in which direct 
marketing organizations are provided with rapid marketing feedback for their clients 
and customers. 

In still another aspect, the invention provides a business model in which print 
yellow page and print business directory publishers increase their existing revenues by 
increasing the yellow page or directory ad size sold by virtue of ad space required by 
a trademark and/or "voice coupon" icon or logo. 

In still another aspect, the invention provides a business model in which print 
yellow page or other print directory publishers are provided with increased usage of the 
yellow pages or directory to show which vendors provide immediate savings. 

In still another aspect, the invention provides a business model in which yellow 
page or other print directory publishers are provided with an enhancement in their 
market positioning by virtue of their providing an more complete and compelling 
offering. 

In still another aspect, the invention provides the closest locations for a 
particular requested category where the location of the caller is known from a caller 
location input, cellular signal triangulation, GPS position determination, or other 
position or proximity location means. 

In still another aspect, the invention provides means for a merchant to interact 
with the system using either voice or web interface and select templates for the type of 
social, economic, political, age, gender, profession, or other image the merchant wants 
to portray and the type of promotion message the merchant wants to publish with a 
small number of mouse clicks, key strokes, or voice commands and prompts. 

In still another aspect, the invention provides system and method for 
establishing user groups ("My" ... group) and communities based on lifestyles, usage 
patterns, interests and interest levels so that a registered user can subscribe to a group 
of multiple groups where merchant listings and other relevant information is given a 
priority. Such group or community may include a bicycling enthusiast who can 
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subscribe to the bicycling group and when he calls, he says "My bike" and will then be 
offered services that has something to do with biking. 

In still another aspect, the invention provides means for obtaining ratings, in 
which once a caller gets connected to the business through the talk41 1 service system, 
after a period of time measured in hours the service calls back the caller to ask for 
ratings or to collect feedback to improve service, where the caller has either registered 
to permit this inquiry or does not have caller ID blocked. 

In still another aspect, the invention provides system and method for merchants 
to post customer testimonials so that future callers can hear these messages as a 
reference that may help make a choice of which merchant they want to be connected 
with, and optionally, as the service gets used callers can leave testimonial messages 
which the business can choose to post for other users access. 

In still another aspect, the invention provides system and method for merchants 
to post key words on the voice system or internet site which can be used as a search 
term by the caller and that will be used as a navigation pointer to the posting merchant. 
In a further aspect, this key word or phrase based search may be used in connection with 
a new product or service (new movie, new CD, new restaurant, or any other product or 
service) so that the caller may speak (or otherwise input) this key word and be matched 
with one or more merchants offering it. In an even further aspect, priority use of such 
key word may be auctioned to merchants for priority playback to callers. 

In still another aspect, the invention provides system and method allowing 
merchants to post their promotional message or company information in multiple ways 
including to record their own, select from a voice talent who would record the text the 
merchant put in the system using voice, a personal computer, or in any other way; or 
just type in an let a text-to-speech processor convert it. 

In still another aspect, the invention provides a coupon aggregation and 
translation engine and service allowing aggregation of different formatted coupons 
from online sites and reformat such coupons to a standard Dialsurf or other format that 
allows them to be played over the telephone phone. 

In still another aspect, the invention provides user call back to remind the user 
to rate a recently used local merchant service. 

In still another aspect, the invention provides means for an over the telephone 
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offer for a user to become an instant member to a community or coupon distribution list 
using email or other communication means once the user asks for a specific category. 
In a further aspect, if the user is registered and set up "my4H" features, the user gets 
coupons for this category delivered to his "my41 1" voice box. 

In still another aspect, the invention provides merchants the ability to identify 
specials in their "additional information" or "coupon message" that is used to find that 
merchant when those words are used over the phone in the key word search mode 

In still another aspect, the invention provides system and method for publishing 
secret words in local newspapers, Internet chat rooms and other community oriented 
online and offline boards, where they can be used on the phone or on our web site as a 
password to enter a sweepstake or win a prize, and to thereby provide a beneficial 
marketing tactic to increase sales of print papers and increase traffic to online portals 

In still another aspect, the invention provides voice coupon targeting based on 
area code and prefix, city, geocoded location, GPS location, zip code, cross streets, 
vicinity of a milestone, major tourist areas, major landmarks, airports, night clubs, 
entertainment centers, shopping malls, restaurants, and the like. 

In still another aspect, the invention provides means and business model for 
facilitating spread of secret words through the word of mouth initiating from someone 
in the company to provide access to privileged information, prizes, etc. to enhance the 
repeat user experience 

In still another aspect, the invention provides the use of live agent interaction 
on a random call basis to provide a surprise element to enhance the user experience, 
where live agents can be celebrities. 

In still another aspect, the invention provides user choice or automatic choice 
of voice of the Talk411 attendant based on gender, age, interest, and other selection 
criteria. 

In still another aspect, the invention provides user choice to select synthesized 
voice of a celebrity as the automated attendant. 

In still another aspect, the invention provides system and method for insertion 
of trivia questions where correct answer wins a prize from a local merchant. 

In still another aspect, the invention provides a business model in which 411 
directory assistance call centers are provided with an added revenue stream to improve 
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slim profit margins, to increase call volume by providing a desirable information 
service, to keep their current carrier customers with value added services, to decrease 
their operating costs and overhead by reducing the number of human employees, and 
by overcoming severe local competition on wireless carriers. 

The invention also provides further apparatus, system, method, operating model 
and business method, computer program and software, and computer software program 
products that interoperate with the inventive systems and methods. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Additional advantages and features of the invention will become readily 
apparent upon reading the following detailed description and appended claims when 
taken in conjunction with reference to the following drawings, in which: 

FIG. 1 is a diagrammatic illustration showing an exemplary embodiment of the 
inventive system. 

FIG. 2 is a diagrammatic illustration showing an exemplary embodiment of 
speech server functionality. 

FIG. 3 is a diagrammatic illustration showing an exemplary embodiment of a 
new business user (merchant) interaction with the inventive system. 

FIG. 4 is a diagrammatic illustration showing an exemplary embodiment of an 
existing registered business user interaction with the inventive system. 

FIG. 5 is a diagrammatic illustration showing an embodiment of a general 
consumer user interaction with the inventive system. 

FIG. 6 is a diagrammatic illustration showing an exemplary implementation of 
the inventive directory service on the Web. 

FIG. 7 is a diagrammatic illustration of a flow-chart for a method of interaction 
between the inventive system and a caller, including call flow procedures and 
procedures for articulating messages to the caller and for receiving inputs from the 
caller. 

FIG. 8 is a diagrammatic illustration showing an embodiment of an alternative 
call handing procedure by the inventive system and service call center. 

FIG. 9 is a diagrammatic illustration showing an exemplary coupons web page. 
FIG. 10 is a diagrammatic illustration showing an exemplary get business web 
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page. 

FIG. 1 1 is a diagrammatic illustration showing an exemplary member menu web 

page. 

FIG. 12 is a diagrammatic illustration showing an exemplary member 
registration web page. 

FIG. 13 is a diagrammatic illustration showing the relationships of several web 
pages to each other. 

DETAILED DESCRIPTION OF EMBODIMENTS OF THE INVENTION 

It will be appreciated that conventional methods and approaches to matching a 
consumer with a merchant, business, or other individual or organization are 
unsatisfactory at best, and severely lacking, in many situations. The limitations and 
sacrifices are described in the Background section of this specification. Even with 
approaches that are more modern than conventional printed yellow pages or telephone 
number providing only 411 -type directory assistance, the actual improvement measured 
in terms of merchant, consumer, or industry adoption has been minimal. A combination 
of technological, philosophical, and psychological needs and forces has conspired 
against the success of such attempted improvements. 

Attention is therefore first directed to an overview of the inventive system, 
method, computer programs and procedures, and Talk411 service and then to a 
discussion of certain needs and goals that are desirably satisfied for such an 
information system to meet the needs of merchants, consumers, and service providers 
as well. These goals do not rise to the level of requirements, as a system that fails to 
meet all of the goals may nevertheless provide advantageous utility, but arguably the 
more goals that are satisfied, the more satisfying the implementation may be. Many of 
the features described in the following paragraphs are also clearly optional and although 
they provide great utility and many benefits, would not be required in a basic system or 
method. 

Talk411 is a one-stop fresh information source that offers instant promotion 
capabilities to local businesses using the telephone or the web. The information service 
is free for consumers. It is a promotion service for local businesses similar to the 
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Yellow Pages, except it can be accessed with voice interaction on the phone or with 
display devices equipped with an Internet browser. 

The information database (content) may be acquired in any number of ways, 
including from one or more of the following ways: (i) Purchased from local telephone 
operators, such as Pacific Bell or other operator, (ii) Input from local businesses who 
subscribe to the service, or (iii) Content partners who provide local information on the 
web. 

The local businesses can subscribe, access and maintain the information they 
want to communicate to the consumers in at least the following two ways: (i) Call the 
Talk41 1 service and voice navigate/input, and (ii) Browse the Talk41 1 .com and input 
by typing the information, uploading business information such as logo, etc. The 
consumers can access the information in several ways as well, including: (i) Call the 
Talk411 service and voice navigate and speak naturally, and (ii) Browse the 
Talk41 l.com web site. 

Selected features that a business may user are now described. These are 
examples to illustrate some interesting aspects of the invention and are not limitations 
or requirements of any particular embodiment. 

Businesses can, for example: (i) Subscribe, register, and input their business 
information and promotional messages using the telephone or the web site, (ii) Become 
a member to the standard service, where they pay a low monthly fee in exchange for 
extended business information prepared specifically for the category they belong. This 
information is provided upon information request about that specific business, (iii) 
Become a Category Sponsor, where a message is played/displayed when a caller request 
businesses in a category, prior to playing the choices, (iv) Purchase Category positions 
for an additional monthly fee, where they purchase the right to be announced/displayed 
at a priority level when the callers/browsers request businesses in a specific category. 
Category Level 2 position for the month will ensure that the promotional message is 
announced/displayed in second place when the caller requests businesses in that 
category. If no Category position is purchased, information will be pulled from the 
database on a random basis, (v) Become a General Sponsor for a larger fee, where their 
message gets heard as the first message as the call gets answered, (vi) Purchase Audio 
Coupons, where the callers get a discount on products/services if they call Talk411 
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service and request information about the specific businesses, upon hearing a prompt 
from Talk41 1 or in any of the promotional messages mentioned above. Audio Coupons 
can be purchased with a credit card on the Talk41 l.com web site, or on the Talk41 1 
telephone service, (vi) Get reports on access/call patterns in their category, (vii) Get 
consumer ratings of their businesses upon participation in the "BayHits" discount 
program. This program allows registered users to get discounts from participating 
businesses in return for feedback on products/services delivered. 

Consumers can, for example: (i) Obtain free of service charges, unlike the 41 1 
directory services which cost about $0.46 to about $0.95 (local phone charges still 
apply) get local business information, (ii) Have hassle free navigation with voice on the 
telephone, focused web site with local business information on the Talk41 Lcom web 
site, (iii) Obtain more information than just the number and address for the local 
businesses. More relevant information, since the business chooses the type of info they 
want to communicate, (iv) Locate businesses of interest by speaking naturally and 
specifying a category, (v) Locate a business by specifying zip code, name, street, phone 
number or other relevant information, (vi) Get discounts with Audio Coupons from 
promoting businesses, (vii) Become a Reviewer by registering to the "BayHits" service 
and get discounts in exchange for rating businesses on the Talk41 1 service. 

From a certain perspective, for a system to be adopted and used by businesses 
and consumers, the system must not only be technically viable but must also satisfy the 
needs of the users. In this vain, the inventive system and method are designed to satisfy 
certain overall philosophical and subjective requirements as well as other more specific 
telephone service, Internet response, merchandising and marketing needs. In the 
paragraphs immediately below, some of the concepts and mechanisms that will 
facilitate providing a high quality of service and acceptance and adoption by the 
business, organization, and consumer communities are described. These should be 
interpreted as goals and guidelines rather than requirements of the inventive system or 
method, as they need not all be adopted or implemented to sustain a successful 
implementation. Rather they contribute to implementing a commercial system and 
method, and as such provide options for a commercially successful system, method, and 
business model. 

A caller or user (whether the business or other organization providing talk41 1 
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information of a customer making an inquiry) needs to be able to interact with the 
system with the utmost ease and simplicity. Speed and simplicity should desirably only 
be compromised for accuracy. 

The effectiveness of the system and service may be assessed and measured with 
the ability of the caller/user getting information with no more than one phone call in 
most cases. A key benefit for the callers are convenience and savings from local 
merchants/businesses, hence improving the quality of their lives. Key benefits for the 
merchant/business/organization are the speed of deploying promotions (or other 
information) at a low cost, ability to get quick feedback from the marketplace and 
change their promotion or message dynamically, hence generating quality self- targeted 
leads. The first impression of the users should desirably be to fun, fast, accurate, and 
helpful information retrieval. 

With these top-level needs in mind, attention is now directed to telephone 
service preferences. Prompts and menu designs, especially for audio prompts, menus, 
or other indicators. Sound may desirably be used as an indicator where possible to 
shorten the amount of words, and a minimum (or reduced) number of questions should 
be asked to get the answer. Where a response may not be clear, it may typically be 
beneficial to ask the user to confirm the input, in order to avoid providing incorrect or 
inappropriate information, and if still not clear, connect to live operator. As consumers 
are already familiar with U.S. nationwide basic 41 1 directory assistance, there may be 
desirability in emulating the basic 411 experience with additional courtesy and 
information. In some embodiments, limit the registration to phone number and 
password only, and direct potential member to web site to complete the registration, 
perhaps offering some incentives for such web based registration. Simple navigation 
using a limited number of intuitive key words to access information with a special lead 
phrase are desirably used. As a positive courtesy, a greeting at entry and hang-up may 
be provided. With respect to response time, it is desirable that the wait not exceed the 
typical time for operator assisted directory assistance, and a wait time of no longer than 
about 10 seconds (and preferably less than about 5 seconds) for information. 

When a caller makes an inquiry, the phone number is retrieved based on the 
business (or organization) name and location, and the caller is then asked whether to 
connect to that phone number. The location can be any designation, for example any or 



A-69342/RM 



- 17- 

all of an address, a city, a zip code, or major cross streets or other recognizable location. 
If a location is not specified, then various rules or policies may be used to select, for 
example, in one embodiment, three possibilities are deliver by city or street names. In 
the event that there is more than one of the business in the area searched, for example 
there may be three Radio Shack locations in the search area, then the caller may be 
presented with three alternatives and locations selected according to any scheme, and 
asked for a decision with which to connect. In either case, the selection may be made 
either by the number or by the location. 

Embodiments of the inventive system and method provide optional features 
including "more" information, and a "bayhit" feature. The "more" feature is provided 
for business or organizations that prefer to provide additional information than standard 
phone number, address, type of business information. A "bayhit" feature relates to a 
rating popularity or recommendation feature as described elsewhere herein. Businesses 
or organization, or specific locations of a business or organization, having more 
information may be identified using a distinctive sound, tone, word, or other audible 
indicator to identify them as having "more" information that may be accessed. Use of 
a distinctive sound reduces the time required to communicate the information and also 
reduces the computational burden of delivering additional spoken words. The bayhit 
feature (or equivalent by another name) may be indicated by a different sound, tone, or 
word than the "more" feature, and businesses or organization that are both more and 
bayhit may be identified with some third set of audio indicators or hybrid or composite 
sounds. 

Where an audio or voice coupon is involved, the audio coupon may 
advantageously be played or enunciated at the time the phone number is delivered. 
Alternatively, the caller may be given an option as to delivery of the audio coupon. 

When a caller request or inquiry is processed by the system, the system 
advantageously retrieves a predetermined number of the closest matches to the 
requested category and location. In North American culture, three seems to be the 
appropriate number, however, providing one to 4 of the closest matches might represent 
the normal range. In some contexts, providing five or more or the closest matches 
might be appropriate with due consideration given to the brevity of delivering so many 
alternatives. 
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Advantageously, the names of the matches are delivered along with the 
availability of any coupons (when the coupon feature is implemented) and optionally 
the content of the coupon. Relevancy may be based on a user location input, this 
location input may be expected to optionally be provided automatically by caller 
identification based location determining mechanisms, cell phone base station 
triangulation mechanisms, or phone based Global Positioning System (GPS) location 
determining mechanisms. When such caller location information is available by 
whatever method, the search and relevancy may be a location sensitive search. 

Upon delivering the predetermined number of matching (or most nearly 
matching businesses or organizations), the caller is requested to chose from these or to 
request delivery of additional matches. Either a predetermined number of additional 
matches may be delivered or the system may request the caller specify how many 
additional matches, with an optional limit imposed by the system to reduce the burden 
on system resources and the total connect time. In one embodiment, the number of 
matches delivered at each stage is three matches, and the total number of additional 
requests that can be made is four. More generally, the number of matches delivered for 
each request and the number of requests for additional matches for the same call may 
be limited to suit local markets and customs. 

The system desirably recognize the caller's choice by business name or by either 
a vocalized queue number or phone keypad press corresponding to the number. IF one 
of the matches has more than one location in the relevant search area the caller may be 
presented with the alternative locations or address information when that business is 
selected. In this regard, the system may filter a business that has multiple locations so 
that the predetermined number of matches are to different businesses rather than to 
multiple locations of the same business entity. If the caller is interested in that business 
entity, then the caller is provided with information as to the multiple locations and 
asked to select or choose the desired location. 

The phone number for the selected business or organization (and location) is 
then delivered and the caller is optionally but advantageously asked whether the caller 
wishes to be connected. The ability to be connected is particularly advantageous for a 
caller using a mobile phone where materials to write the phone number may not be 
readily available. Automatic connection is advantageously free to the caller and may 
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be free to the business or may incur some additional fees. Various alternative credit, 
bonus, fee and non-fee based mechanisms may be implemented. 

In some embodiments, it may be desirable for the caller to provide a phone 
number to a particular business name and location, and to have the system search and 
verify a business name by saying the phone number, and getting the business name back 
with the coupon if available. This feature may be referred to as a reverse search by 
phone number with optional coupon retrieval. This feature permits the caller to 
effectively make an inquiry as to the availability of a coupon offer at a particular 
business establishment, to make comparisons between business establishments, and to 
make a decision accordingly. 

In a further embodiment, a coupon request may be made by the caller asking for 
a category alone, a category and location, or any other item or combination of items on 
which the database may be queried. Based on the results of such a category or other 
criteria search, the coupon may be provided. Note that although this and the above 
discussion focus on the voice based system, these coupon mechanisms (particularly the 
ability to perform directed searches on the basis of business identity and category) may 
also be implemented for the Internet based service. 

Desirably, audio coupons delivered in this manner should have a maximum 
permitted length. For example, in one embodiment coupons are limited to 10 seconds 
and in another embodiment to 5 seconds or less. Caller consumers are not generally 
interested in receiving additional advertising, rather they want to know the essence of 
the offer, what discount, free item, cost saving, or other enticement is being offered, and 
no more. They are then free to make an inquiry once connected to the business, 
merchant, or other organization as to further information. Again, there is desirability 
of satisfying the needs of the caller consumer, the desires of the business, and the 
practicalities of system implementation where the number of lines, processors, servers, 
and other software and hardware infrastructure and impacted by call volume and 
duration. 

Once the caller selects the matching business or organization with which to 
connect, the system dials and connects the call, checks for an answering machine tone, 
delivers a message that informs the business that this is a Talk4 1 1 call and identifies the 
coupon promised to the caller, and transfers connection with the business to the caller. 
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Optionally, the caller may be connected to the business during this hand over or transfer 
process, but normally this is disadvantageous as it appears to add undue complexity to 
a process that the caller should view as transparent and effortless. Advantageously, in 
one embodiment of the inventive system and method, during some predetermined time 
interval following the connection to the selected business, the caller can take action to 
terminate the call with the business and reconnect with the talk411 system. This 
presupposes that a connection be retained with the talk411 system, or that a 
re-connection mechanism be provided. These mechanisms are readily available in the 
telephony art. 

This predetermined time period is anticipated to be in the 1 5 second to 2 minute 
time frame, and nominally about 30 seconds so that if the caller determines the selected 
business does not satisfy the need of the caller, he or she can try again without undue 
difficulty or expense. In one embodiment, the caller action involves pressing a key or 
combination of keys so that commands to the system may be clearly distinguished from 
a verbal discussion with the proprietor or employee of the business. A tone signaling 
scheme will likely be more readily interpreted than a voice command, particularly if 
automated switching and routing is involved. 

Embodiments of the system provide the ability for a caller to rate or otherwise 
provide positive or negative feedback regarding a talk411 member business or 
organization. Desirably, the rating of any particular member should be limited in some 
way to avoid the possibility of a single caller, unduly influencing a member rating in 
either an absolute or statistical sense. The limit may be rules that provide for a 
maximum total number of rating inputs for a business, a limited number per month or 
per year, or some other rating restriction rules. The goal is to provide other callers with 
a rating that fairly represents the opinion of all callers that have had an experience with 
the business, and not to allow either an individual that has had a bad experience to rate 
the business negatively multiple times. These limitations would also prevent an 
individual from inflating the rating of a member business by calling multiple times with 
a positive rating. Control mechanisms would also be desirable to keep the proprietor 
of the member business from self-rating in an unwarranted manner. These controls may 
involve technological mechanisms and/or legal contractual agreements with the member 
businesses. 
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In one embodiment, a caller desiring to rate a member business has to sign on 
or register in some manner so as to identify themselves. For example, the caller may 
sign on to the system by saying a phone number and a password. Individuals desiring 
to participate in rating member businesses may be provided with certain benefits, points 
in a points or award program, discounts, or other promotional and/or marketing 
enticements for their participation. In one embodiment, the system implements a 
Rate2save points program wherein a registered caller earns points that may be redeemed 
for goods or services in exchange for rating businesses of organizations. In another 
embodiment, registration is not required, however, the rating made is associated with 
the caller's telephone number if the system can determine the callers caller ID so that 
multiple ratings of the same business from a single telephone number cannot be made, 
or cannot be made within predetermined time intervals. In the desire to maintain 
privacy, a message may also be played indicating that the caller's telephone number will 
be examined and compared for the purpose of ratings control and requesting permission 
to continue. 

A caller will say the name of the business, location or other information such 
as phone number to quickly identify the business and rate on some predetermined scale, 
such as for example, a scale of 1 - 1 0, or any other numerical or non-numerical scale that 
permits the caller to use other adjectives such as for example poor, good, great, 
exceptional, too expensive, no parking, or any rating information that would be of use 
to a later customer. Ratings in multiple areas may also be permitted. For example, a 
restaurant may be rated according to food quality, food portion, service, price, wine 
selection, parking availability, or other factors of interest to potential customers. As 
system evolves and direct or indirect feedback is obtained from businesses and 
consumers, particular rating categories and scales found to be useful will be identified 
for the various categories of business and the ratings categories may be updated and 
modified accordingly. 

Once the caller rates the business, points or other benefits that the caller rater 
may be entitled to will be reported and the system will hang up. These ratings will then 
be compiled and made available to subscribed member merchants and businesses on the 
web as well as the telephone. 

For embodiments of the invention that provide a hit or popular site (restaurants, 
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night clubs, coffee shops, bookstores, etc), best sites, highly recommended sites, or the 
like listing or directory such hit sights are referred to as Bayhits after the San Francisco 
Bay / Silicon Valley region where the invention was developed. Other regional areas 
may adopt other names for such attractions, such as Twin-City Hits (Minneapolis -St. 
Paul Area), LAHits (Los Angeles), or any other title whether containing "hits" or not. 
Callers may make requests by category such that such BayHits (or equivalent) may also 
be found by category and delivered in the manner already described. 

While many of the advantageous features of the inventive system and method 
are provided using voice recognition and text to speech processing over existing 
telephony infrastructure, other administrative facets may advantageously be taken care 
of using an Internet Web site or live agent. Live agents may also be occasionally used 
to set up and register. These administrative features may only be implemented where 
the clarity and preciseness of textual input and/or the volume of input increase the 
efficiency and accuracy of the process. These administrative procedures may also 
involve hybrid voice and Internet components. For example, an initial stage of 
registration or sign-up may be made over the telephone with minimal information 
provided over the telephone. The callers are then directed to a web site where 
registration may be completed. This form of registration may apply to either or both of 
businesses and consumers, for example consumers enrolled in the Rate2save points 
program. The advantage of such on-line or web based registration being that such 
registration does not tie up a phone line that might better be made available for 
responding to caller requests, and the accuracy associated with typed or menu driven 
registration may be somewhat greater given the current state of voice recognition 
technology. It is important that data entered into the database be accurate, and such 
accuracy is promoted by test input. This does not preclude the use of voiced or spoken 
input and it is likely that as voice recognition technology improves that the differences 
between text or character driven digital input and recognized and converted speech to 
digital inputs will lessen. 

Merchants, businesses, and other organizations may register by providing their 
phone number and a password for a trial period. This may include a simple registration 
with minimal information obtained over the telephone and direct callers to the web site 
to complete the registration in order to take full advantage of all merchant and business 
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services and options. In instances where further registration beyond the trial period are 
required, especially if the additional registration is required for the merchant or business 
to avail themselves of additional capabilities, such registration may be completed over 
the telephone, over the Internet, or through the use of paper document of electronic 
contracts or agreements. 

After or as a component of registration, the basic information for the business, 
such as formal and common name of the business, business address, business phone 
number, are verified so that the business may subscribe to the inventive Talk411 
service. If the business or organization is not in the database contact is made though a 
customer service representative or other means. Credit card or other bank or payment 
information is provided to initiate of continue subscription to the service. The business 
or organization is then provided with an opportunity to record a message to be heard by 
a caller when callers ask for more information about the business. In one embodiment, 
the duration of this business introductory message is limited to a predetermined period 
of time, usually between 10 seconds and 1 minute, and more typically not to exceed 
20-30 seconds. An opportunity is given to review and re-record the message, to 
confirm the message, and ultimately to publish it. In one embodiment of the invention, 
provision is made to allow a celebrity or other talent to record the message. This 
optional service and feature is described in greater detail elsewhere in this specification. 

Various levels of membership are supported and different optional features may 
be made available at the various membership levels. Membership and subscription fees 
may in some instances be tied to membership level or to options selected. 

Whether provided as a membership level or an option within one or more 
membership levels, a business or organization once subscribed (or registered in a trial 
period), may purchase sponsorship for the category they belong to by specifying the 
category, number of months category sponsorship is desired, and level of sponsorship. 
In one embodiment of this category sponsorship feature, the sponsor business is put 
in an ordered list or queue (or random pool) along with others who have paid for the 
same level of sponsorship. When its turn comes (whether through a sequential ordering 
or via a random or statistically determined selection), the name and phone number 
(along with a attached coupon, if any) for that category sponsor business or organization 
will be heard if a caller asks for businesses by category. 
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In one embodiment, there are three levels of sponsorship. Level 1 consists of 
those sponsors who paid the highest to have a greater frequency of presentation to 
callers (a higher hit rate), level 2 with lower frequency or lower hit rate and cost, and 
level 3 with the lowest frequency and hit rate and cost. In another embodiment, a 
category sponsor may pay for a fixed number of placements per week or per month. 
Other placement parameters may pertain to geography, time of day, length of message, 
or other parameters that may tend to drive more business volume to the category 
sponsor. While category sponsorship is one type of sponsorship that may be 
implemented, sponsorship opportunities may exist for other aspects of the Talk411 
service. 

Voice or audio coupons may be created and purchased once a business or 
organization have subscribed or registered. Typically, a merchant will telephone and 
specify the number of coupons he or she wants to purchase and provide credit card 
information or other billing information and confirm the purchase. Usually, coupons 
will be bought in quantities in excess of 1000, though there are no limits on the quantity 
that may or must be purchased as any single transaction. The message that will be 
played in the form of a coupon is recorded (by the merchant or an agent of the merchant 
(such as a hired talent), along with the name and phone number of the business. This 
coupon message will not normally exceed a predefined duration, though different 
durations may be imposed for different types of business, membership entries, 
geographic regions, or other criteria. Typically the voice or audio coupon will not 
exceed 30 seconds in duration, more usually not exceed 1 5 seconds, and frequently will 
not exceed about five seconds. Some flexibility may be provided according to a 
minimum and/or maximum information that the merchant is entitled to record. 

In one embodiment, once a caller asks for the business, hears the audio coupon 
and gets connected to the merchant, the merchant is notified by the system that this is 
a Talk411 call and play the same coupon message to remind the merchant (or an 
employee of the merchant) about their promotion so as to avoid any ambiguity or 
problem associated with an uniformed employee contesting the caller's (now 
customer's) entitlement to the coupon offering. 

In one of the embodiments, providing for talk41 1 ratings, a simple cumulative 
rating that is compiled from the raw ratings of the users is delivered when the merchant 
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calls to check the rating. Security protections such as password or numerical code may 
be required to access such ratings, particularly when greater detail is provided to the 
merchant than to the general public. There may also desirably be a ranking within the 
category to which the business belongs. These ratings will likely be adjusted from time 
to time and evolve to suit the needs for each category. In one embodiment, there is a 
simple 1-10 rating, where 1 is the lowest rating and 10 is the highest rating. Other 
embodiments provide for callers to rate the business on multiple criteria as may be 
relevant to the category. 

It will be apparent to those workers having ordinary skill in the art that the 
inventive voice or speech based system provides numerous features, capabilities, and 
advantages over existing systems. The inventive system and method also provide for 
both separate Internet or world wide web based interface as well as to an integrated 
system providing both voice/telephone accessed and character/Internet based access. 
All, or nearly all, of the information described for the voice/telephone accessed features 
and capabilities are also available for access through a computer graphical user interface 
(GUI) over the web. It is not required to be able to interact with the web-based user 
interface with voice, or to connect the user to the merchant with the phone in this phase, 
however it is desirable to be able to play back the messages during an Internet 
web-based session that were recorded on the phone. It may even be desirable to allow 
a user to record a message using local available computer resources and upload the 
message to the system, however, this is not required. 

In other embodiments of the invention, the user is provided with the capability 
to select a business and connect to the business using the Internet with Voice Over 
Internet Protocol based technology. 

Desirably, the graphical user interface, provides a differentiation between a 
recorded message versus a text message prior to someone clicking the prompt, so the 
user expectation is set properly. 

An embodiment of the Internet web-based graphical user interface is now 
described. For reasons of simple access, the web pages are desirably sized and 
formatted to display on a single viewing screen without the need to scroll. 
Advantageously, this is accomplished independently of the user's currently set screen 
resolution. Intuitive buttons and menus are advantageously provided where access and 
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interaction is as simple as clicking on the available selections, and a single mouse click 
is used where possible. Quick page access is a high priority to encourage use without 
undue effort, therefore any graphics are desirably used judiciously only where 
necessary. 

Simple and clear icons, tabs, drop downs, and the like features are used. Some 
space may be reserved for ad banners and other content that may provide useful 
information for the user or a source of revenue for the site, site operator, or talk41 1 
provider. Coupons will desirably be highlighted so the user can easily understand that 
they are coupons, and provide facility for printing either as part of the web page or via 
a separate but readily available free program. 

Information storage, search, and retrieval are desirably fast and efficient to 
minimize any delays. Search and retrieval of business/merchant information such as 
name, phone number, location, address (any elements of the address) from any of the 
known data above. 

Search by category is supported, desirably with result priority given to category 
sponsors, and to those with coupons. Reserve a small section where BayHit businesses 
can be featured with a distinct graphic, linking to their business information. 

The result of a search will yield a short list of found or matched (or closest 
match) businesses with the business information for a predetermined number of 
(typically 2-5 and usually 3), category sponsors visible in most of the page. The user 
may then click and select others in the list if not satisfied with the sponsors found. Each 
result page served with the next button should desirably serve the same format as above, 
except changing the sponsors and the short list. 

For the business user, subscribed business users may be provided with one page 
where they can provide essential business information that will show as a result to a 
user search. They can select from an available selection of templates, upload their logo 
or graphics, and input their information. There will be a dedicated part for displaying 
their coupons if purchased. Custom pages not conforming to the template may be 
available as part of the standard subscription, available only to higher membership 
levels, or available at a higher fee. In one embodiment, customization is permitted but 
only within a certain range of options that preserve the talk41 1 look and feel. 
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A dedicated management page may optionally be made available for the 
subscribers. Subscribed business will be able to track their coupon promotions, 
category sponsorships, how they are rated in the category, and their account history. 
Other information may also be provided. 

There will desirably be a provision to provide call volume, page hit rate, and 
comparative business results for businesses in the same category. This feature may be 
enabled when the volumes reach predetermined levels. Based on account activity and 
ratings, the talk4 1 1 service provider or an agent thereof will publish recommended 
action promoting our category sponsorship and coupon opportunities. Therefore it is 
desirable though not required that some space dedicated to Talk411 be reserved on 
every merchant page. 

In one embodiment, a business, merchant, or organization are able to type in or 
otherwise enter or present the Coupon message or the Business information into the 
provided field which will then be translated into speech when a caller asks for the 
information over the telephone. Alternatively, Business/Merchant will be able to type 
in the information, and request the information to be recorded by a professional voice 
(or audio) talent from the list provided on the web site. This alternative feature is 
provided via an optional voice talent portal (VTP). Through this portal or other means, 
a user is able to select, such as by clicking on an icon or using a pull down menu, and 
hear a recorded clip of a message delivered by a registered voice talent as a preview. 
Each registered voice talent will have a private page where pending jobs will be posted 
and messages to be recorded displayed. Once the voice talent completes the recording 
of the job, he/she will be able to upload the wave file (or other file format) for the 
recording to their private page, and verify and publish it for review by the business. A 
phone alert with the recorded message will be made to the merchant or organization to 
confirm and publish the message. 

Upon completion with business approval, their credit card or other account will 
be charged or invoiced with a portion of it paid to the voice talent. Using the facilities 
available over the Internet, including the ability to use email and record, store and 
communicated high quality digital recordings, it is anticipated that such process should 
not take longer than 24-72 hours, and that for highly sought after talent, not more than 
one week. Additional fees and time may be required for premium talent. The ability 
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to provide a quantity or volume of messages to a voice talent is a key value for 
merchants and businesses who want professional recording of their messages and don't 
want to pay the full hourly fee for such talent, or invent the overhead associated with 
contacting and negotiating a contract with the talent or their agent. Using the inventive 
system and method, talk411 aggregates the messages to be recorded and lets the 
merchants share the cost of voice talent recordings. 

In one particular embodiment of the inventive Talk4 1 1 service, the design of the 
system is guided in part based on empirical results. These results suggest that the 
voice-recognition should desirably be highly accurate, preferably accurate better than 
95% of the time. Automated announcements should desirably be a compelling and use 
consumer tested voice. Response time to voice inputs should ideally be no more than 
about 1 second in most cases, and typically no more than 3 seconds when searching. 
Some fine tuning of the input and search timing will occur when the system is fielded. 
Callers should desirably be able to interrupt any time and the system and method 
should desirably be able to deal with this. Transactions and identities should desirably 
be kept secure by the high standards. For consumer calls, when the system does not 
understand after two tries, it should desirably bounce to a live operator, or take other 
action in an attempt to understand the consumer. For example, if a live operator is not 
available, the system may use a more sophisticated (but possibly slower) processing 
algorithm to understand the speech. For business subscriber calls, when they say, 
"help" there should desirably be some automated help, and when they say "save me" 
they should desirably be connected to a customer service representative. 

Various hardware and software configurations, computer program code 
constructs, procedures, and mechanisms, as well as a variety of different operating 
scenarios and protocols for accomplishing these and other goals are now described. 

Having now described embodiments of the overall system as well as several 
operating scenarios, attention is now directed toward a description of two particular 
embodiments, an Internet web based embodiment and a voice processing based 
embodiment. Attention is first directed to a description of the voice processing based 
system including the voice processing hardware and software. 
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Voice Recognition Based System And Method 

FIG. 1 shows the top or high level system architecture 102 and consumer user 
(or caller) 101 access points for a preferred embodiment of the invention. A cellular 
phone 106 (or other wireless device), standard telephone 118, either an analog (POTS) 
or digital, can be connected to the architecture 102 using a standard telecommunication 
link 120, such as a standard telephone line 122, ISDN line 124, cable 126 or DSL line 
128. For a cellular phone 106, it is understood that there is a cellular base station 108 
and a cellular switch 110 interposed between the cellular telephone 106 and the PSTN 
112. The incoming call 130 from the cell phone 1 06 or the telephone 1 1 8 goes through 
a PSTN 1 12 and telephone switch 1 14 and gets picked up by the Speech server 116, 
which is connected on one side 1 17 to the telephone switch 115, and on the other side 
1 19 to a computer network 130 such as for example the Internet. 

Access may be via the Internet, for example access using wireless devices using 
or compatible with the Wireless Access Protocol (WAP). Data may therefore be 
communicated in a WAP compatible format. WAP is a standard set of protocols for 
wireless Internet access which can deliver special Web pages to portable devices such 
as smartphones and palm-portables. In current versions it relies on the use of WML 
instead of HTML and so on. These systems generally support Java and Windows CE, 
among others. WAP Version 1.0 has been released and is hereby incorporated by 
reference along with variants and extensions thereof. Other versions are under 
development. 

The user (or caller) 101 can access the information or data 132 that resides in 
a database 134 within the Information Center 136 and the Web Site 138 through 
interactive voice commands 140 and/or through keypad presses 142 on the caller's 101 
device, such as on the cellular telephone 106 or standard wired telephone 1 18. In a 
preferred embodiment of the invention, only the caller's voice commands are used. The 
caller's voice commands 140 are recognized and translated into one of the variations of 
Voice Extensible Markup Language (VXML, VoiceXML, or VOXML) commands 144 
by the Speech Server 116 using a speech-to-text conversion engine 146 and once 
translated into VXML are used to retrieve the information 132 from the Information 
Center 136 database 1 34. VXML is an extension or elaboration on the XML (Extensible 
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Markup Language) standard known to workers in the art and not described in greater 
detail here. Information concerning the VXML Forum is available on the world-wide- 
web at http://www.vxmlforum.org/ and Version 1.0 of the VoiceXML specification 
dated 07 March 2000 which is hereby incorporated by reference is available in Adobe 
Acrobat format at http://www,vxmlfomm.org/specs/VoiceXML-100.pdf . 

Once the data 132 is retrieved and transmitted back to the Speech Server 116, 
the text information from the data 132 is converted to speech using a text-to-speech 
conversion engine 148 within the speech server 116 and played back to the caller 101 
using the caller's device 106, 118. Speech server 1 16 also generates and plays back 
(presents) pre-recorded or synthesized menu commands 150 to the caller. The system 
architecture connects 102 the information database 134 to the Internet 130 (or other 
local or global network of computers and/or information appliances) which can also be 
accessed with a display device 152 such as a personal computer (PC) equipped with a 
modem 154 (wired or wireless), a smart phone 156, other wireless phones or devices, 
a PDA or palmtop device 1 5 8 or any computer or other information appliance or device 
that can be connected to the Internet (or other local or global network) with the ability 
to display standard Hypertext Markup Language (HTML) pages or other formats 
interpretable by the computer 152. Access to such wireless devices may use the 
Wireless Access Protocol (WAP) and/or other protocols or standards. 

It is noted that although reference is made to several current industry standard 
data and information formats and protocols, such as HTML, XML, and VXML, the 
inventive structure and method are not limited to these particular formats and/or 
protocols or to the versions of these protocols in existence at the time the invention was 
made as those workers having ordinary skill in the art will appreciate the capabilities 
and features provided by these formats and protocols may be provided in other ways 
and that future versions of these formats and protocols will also support the inventive 
structure and method. 

Embodiments of the inventive system may desirably incorporate and utilize 
natural language speech recognition. In such implementations, the user can naturally 
speak and the system interprets the user's speech to extract the request or inquiry. The 
provides additional flexibility for a user as that user does not need to know any 
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particular commands or request rules or syntax. Natural speech processing and 
artificial intelligence are known in the art and not described in greater detail here. 

The system and/or method described herein may be deployed as a separate 
installation or deployed through application service providers (ASPs) who have the 
hardware and software deployed either nationally, or in the local areas. Teleron and 
Voiceeo are examples of such providers. 

FIG. 2 shows an embodiment of Speech Server 116, some of its functions and 
functional connectivity to receive a switched telephone call and to interact with the 
Internet 130. Speech Server 116 performs several tasks such as the task of providing 
a Network Interface 160 to the analog or digital phone network that provides the 
switched phone call 162, Automatic Speech Recognition (ASR) 146 or speech-to-text 
conversion (STT), Text- to- Speech conversion (TTS) 148, runs the application or 
application program 164 that control and manages the phone calls 162 and the 
Interactive Voice Response (IVR) 166. IVR refers to the interactive voice response 
which is conventionally a menu driven response provided in response to an input. A 
user is asked to say something (for example, "Press or say 1 for marketing, press or say 
2 for research", etc.) However, the inventors are not aware of any such conventional 
systems that provide ASR or text-to-speech in connection with IVR. In one 
embodiment of the invention, the Speech Server 1 16 is a personal computer equipped 
with Dialogic Antares automatic speech recognition boards and other products. 
Information regarding the Dialogic Antares boards are available from Dialogic 
Corporation, 1515 Route Ten, Parsippany, NJ 07054-4596 USA and on their web site 
at http://www.dialogic.com/products/indx_abp.htm. 

Operation of the exemplary Speech Server in the system is now described. The 
incoming call 1 62 is answered by a network interface card 1 60, such as for example a 
Dialogic network interface card (analog or digital). A prompt is played to the caller 101 
over the caller' s device 1 06, 1 1 8 asking the caller to say the selected item 1 70 from the 
available selections on a voice or audio menu. When the caller responds to the request, 
the application 1 64 passes the voice data to the auto speech recognition block 146, such 
as may be provided by a Dialogic Antares™ board loaded with an Automatic Speech 
Recognition (ASR) software. ASR software is available from several sources, 
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including for example from Lernout & Hauspie (L&H) (LERNOUT & HAUSPIE 
Burlington, MA, Phone: 1-781-203-5000, Fax: 1-781-238-0986, http://www.lhs.com) 
or SpeechWorks (Speech Works International, Inc., 695 Atlantic Avenue, Boston, MA 
02111, Tel: 617.428.4444, Fax: 617.428.1122, http://www.speechworks.com). 

A Dialogic Antares board-based automatic speech recognizer (speech-to-text) 
146 translates the voice data into ASCII text (or another code or symbols) that identifies 
the spoken words and returns a text or other symbolic representation of the results to 
the application 164. The application 164 accesses, via for example a T-l line or faster 
Internet connection, the database 1 34 of the Information Center 136. Real-time (or near- 
real-time), active vocabularies are generated at run-time using the database's 1 34 ASCII 
text or symbols. The application uses the ASCII text from the database 134, passes it 
to a second Antares board 148 running a text-to-speech (TTS) algorithm. The TTS 
algorithm generates the final voice or audio information that is played to the caller 101 . 

Voice processing hardware is now described relative to one practical 
embodiment of the invention, using a Dialogic Antares D41E printed circuit card 
configured for a Microsoft Windows based computer that supports up to four analog 
telephone lines and provides a telephony interface and voice processing functions such 
as playing and recording speech messages and collecting or dialing DTMF digits. The 
embodiment also utilizes a Dialogic Antares 2000/50 platform is used in one 
embodiment, and allows the Nuance Voice recognition system (or other voice 
recognition system with appropriate modification) to operate properly on the Dialogic 
platform by providing echo cancellation and end pointing services for the system). The 
installed card will support up to 16 channels of voice recognition. 

The Antares 2000 family platform provides a standalone open digital signal 
processor platform for telecommunication applications. The Antares hardware platform 
provides four independent TI TMS320C31 32-bit floating-point DSPs running at 50 
MHz, each with high-speed SRAM, enabling algorithms to be easily ported to the 
Antares™ board. Multiple memory options provide flexible solutions for different 
applications. For example, 5 12 KB (128 K words) or 2 MB (512 K words) local SRAM 
are available per DSP and 4 MB (1 M word) or 8 MB (2 M word) global DRAM 
memory are available per board. SCbus™ (or PEB™) connectivity allows standard 
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access to off-the-shelf call processing products and provides the capability to build 
higher density systems. Up to 32-channel capability is available. Drivers are presently 
available for MS-DOS®, UNIX® (SCO®, UnixWare™, Solaris®, and ADC®), OS/2®, 
and Windows NT®. 

The Antares platform open development environment uses the Antares board 
and an ANSI C Compiler and Assembler/Linker for the DSPs. SPOX support includes 
the run-time SPOX DSP operating system and SPOX-KNL kernel development kit. 
Additional software includes the Antares kernel , which integrates the DSPs into the 
SCbus system and a down loader to download software to the board. 

A single Antares board can process up to 32 channels of digital voice and digital 
telephony information received from a network interface board via the SCbus. The 
SCbus bit stream from the network interface board can contain both audio (digitized 
voice) and telephony signaling information. The bit stream is applied to a SC2000 ASIC 
chip which has an internal switching matrix. Under firmware control, the SC2000 chip 
can connect any external bus time slot (1024 for the SCbus or 32 for the PEB) to any 
TDM (time division multiplexed) bus time slot on the Antares board. The Antares 
board uses four C3 1 DSPs, each with its own Static RAM (SRAM) to separate and 
process data for any assigned time slot. Each DSP processes data based on downloaded 
firmware stored in its SRAM. After processing the incoming data, each DSP 
communicates with the host PC via global Dynamic RAM (DRAM). Kernel services 
are available on every DSP. The Antares local kernel provides all the necessary tools 
to read and write data to and from the TDM bus, which operates as a PCM (pulse code 
modulated) highway. By using the DSP's internal DMA (direct memory address) 
capability, the kernel can bring multiple frames of PCM data (up to 32 time slots each) 
into DSP SRAM from the TDM bus. Each DSP connects to the local TDM bus via a 
serial interface. The TDM bus is implemented with 32 time slots that are mapped into 
1024 external SCbus time slots by the SC2000 chip. Each SRAM contains the 
downloaded firmware that controls DSP processing and also provides the DSP with 
dedicated RAM for its data processing operations. 

A control port on the Antares platform accepts operational commands from the 
host PC and monitors DSP-to-host PC interrupt status. This control port controls 
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Antares platform operations such as: DSP reset; communications with and firmware 
downloading from the host PC via shared global DRAM read/write operations; the 
transmission of interrupts, control, and status information to and from the Antares 
platform via the global resource circuits. When the system is initialized, firmware to 
control each DSP's processing is downloaded from the host PC to the DSP's SRAM. 
This downloadable firmware enables feature enhancement, application changes, and 
upgrades. 

Additional information on this item as well as on other products that may be 
used in conjunction with the inventive system may be obtained from Dialogic 
Corporation is an Intel company, with world headquarters at 1515 Route Ten, 
Parsippany, NJ 07054-4596 USA. 

The above described Dialogic hardware and the computer system in which it is 
installed, inter-operates with an operating system (such as for example, a Microsoft 
Windows operating system, a Linux operating system, a Unix operating system) and 
application program software including device drivers. 

In one embodiment, the Talk411 system is an integration of commercially 
available software applications and custom applications. These commercial software 
and custom software components include operation under a Windows NT Workstation 
(v4.0 Service Patch 4) operating system environment which is available from Microsoft 
Corporation of Redmond, Washington, USA. The software may alternatively be 
adopted to interoperate with any of the other commercially available operating systems 
such as earlier and later versions of Microsoft Windows, such as for example Windows 
2000 Professional; or non- Windows operating systems such as Linux, Unix, and others 
as are known in the art. 

The core voice processing application is Parity software's VOS Version 6W 
(Version 6 for Windows). Parity VOS (V6W dated 07-1999) is available from Parity 
Software, Three Harbor Drive, Suite 1 10, Sausalito, CA 94965. Parity Software is a 
Dialogic company which is owned by Intel Corporation of Santa Clara, California. 
Parity Active Data Objects RLL (ADO RLL) is also used and is available from Parity 
Software, Three Harbor Drive, Suite 110, Sausalito, CA 94965. The Nuance voice 
recognition support the voice recognition requirements. Nuance Speech Recognition 
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Software (v6.2.1) is available from Nuance, 1005 Hamilton Court, Menlo Park, CA 
94025 . A VOS-Nuance integration Run-Time Link Library (RLL) provides an interface 
between the Parity VOS software and the Nuance speech recognition system. An 
embodiment of a VOS-Nuance Integration Run-Time Link Library (RLL) is available 
from TeleVoice, Inc., 1 1767 Katy Freeway - Suite 425, Houston, Texas 77079. These 
software and system features are described in greater detail below. 

The Parity VOS system includes a VOS Script file or files which have been 
compiled to binary code which can be executed by the VOS runtime-engine. In Version 
6W, VOS licenses are required both to run the system (a run time license) and to 
compile the VOS script (a developer license), both of which are available from Parity 
Software. (A developer license can be used as a run-time license but a run-time license 
cannot be used as a developer license.) Licenses are controlled by a hardware-key 
("software sentinel" or "dongle") that plugs into the parallel port of the system. 

Additional functionality to the basic Parity VOS script language may be 
achieved by external modules known as Run-Time Link Libraries or RLLs. Parity 
Software provides some RLLs with Parity VOS and other RLLs which may be desired 
to implement additional features are available from third-party vendors. RLLs are used 
for example, to enhance the VOS capabilities to include such features as voice 
recognition, text-to-speech, access to ODBC-compliant databases and convert numeric, 
date and currency values to speech. 

The application specific program modules used for the inventive Talk411 
system and method include a Nuance voice recognition interface module and a 
SpeechPro module which converts numeric, date and currency values to speech. Other 
providers of the broad class of text-to-speech converters (of which numerals-to-speech 
conversion is a simple subset) are known in the art. For example, Parity VOS provides 
a similar RLL called SmoothTalker with the VOS application. Those workers having 
ordinary skill in the art will appreciate that there are a number of competitive 
commercial products that may be utilized and that the rapid development in this 
technology area will likely result in a proliferation of software and hardware products 
that may be used in conjunction with the inventive system and method. 
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In Windows environments (such as in Microsoft Windows NT, Windows 2000, 
or the like environments) , RLLs take the form of DLLs or Dynamic Link Libraries. 
RLLs are normally specified in a file (such as for example in a RLLS. TXT file) which 
must be referenced both when compiling and when executing. These programming and 
compilation conventions and procedures are known in the art and not described in 
greater detail herein. 

By convention, script files used by the application are designated by a vs" (vos 
script) or ".vh" extension, and compiled VOS scripts are designated by a ".vx" 
extension. Exemplary embodiments of selected ones of the files are summarized below. 
Additional detail is available in programming manuals and user documentation from 
the vendor, either by mail at the address provided elsewhere herein or via an Internet 
web site provided by each of the vendors. 
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Table I. Exemplary VOS Configuration Files 



(Note that ".def 1 files are the source files, ".par" files are the compiled version of the ".def ' file) 
cpb.def Stores Dialogic specific configuration information (Channel 

Parameter Block) 

CPB.PAR Compiled cpb.def file 

hedge=2 Detect greeting on trailing edge for outbound calls 

nbrdna=6 Rings before no answer signal given on outbound 

intflg=8 Specifies to use full Call Progress Analysis for outbound calls 

init.def Stores VOS initialization settings 

INIT.PAR Compiled VOS initialization settings 

sc_pcpa=l Use Call Progress Analysis 

sc_words=20 Maximum words in a phrase (used by SpeechPro) 

sc gtdterm- 1 Let hang-up tones terminate any voice functions 

fil_buf=l 6 File copy buffer size (in Kilobytes) 

pr.def Defines indexed speech files 

PR.PAR Compiled pr.def file 

tone.def Stores PBX specific Tone configurations (generated by Dialogic 
PBXpert prog.) 

TONE. PAR Compiled tone.def 



35 



10 



. r-s 
IB 

fU 

- EES 



15 



20 



A-69342, 



-37 



Parity VOS configuration files are listed and briefly described in Table I. Parity 
VOS Source and Executable code files are listed and briefly described in Table II. 
Table III lists and describes various Other VOS related files. Additional description is 
provided relative to files and procedures that have been developed to support the 
inventive system and method and are not part of the commercial software programs. 

It is noted that although Parity VOS is used in the particular exemplary 
embodiment, the inventive system and method are not so limited and that those workers 
having ordinary skill in the art will appreciate that other programming language and 
methodological constructs may be used to achieve the invention, such as for example 
implementations using VoiceXML, C++, or other scripting or programming language 
or languages. 



Table II. Aspects of VOS Source and Executable code 



(Note that ".vs" and ".vh ,! are source code, and ".vx" is executable code) 

master. vs This file "kicks off 1 the Talk41 1 application for each voice line 

master. vx Compiled "start" file 

Spro.vh Header file for SpeechPro commands and constants 

spro.vs VOS source code for SpeechPro functions 

talk41 l.vs This is the main application file 

talk4 1 1 . vx Compiled main application file 

VrCmd.vh Header file for VR commands and constants 



Table III. Other VOS Related Files 


directory2.mdb 


Microsoft Database file used to store demo business info. 


LocalRlls.txt 


File used to specify VOS RLLs for compilation 


Rlls.txt 


File used to specify VOS RLLs for Run-Time 
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The master.vs file is the file specified when launching the Parity VOS run-time 
engine. It contains all of the code necessary to open up the database, initialize and 
configure the voice lines and the voice recognition channels and spawn or launch the 
Talk411 application for each voice line. The presence and general structure of 
5 master.vs (and it's compiled partner master. vx) are known to workers using Parity VOS 

and other such commercial programs. Master.vs for the talk41 1 system is a file that 
does some set up, such as, determining how many phone lines are installed, and 
launches one talk41 l.vs application for each of the installed lines. It is somewhat like 
a computer system startup file and initializes some variables, establishes connection to 
10 the database that stores the business/merchant/organization information that will be 

retrieved in response to caller's requests, and then spawns one copy of talk41 l.vx for 
each line that it detected as being installed. Master.vs is specific to the talk411 
M* application but it is generic in that each application would usually have a master, 

ry Several of the specific tasks performed by master are now described in somewhat 

!Lf 15 greater detail. 

*L Master, vx first determines and defines the number of voice lines installed. This 

W represents the number of simultaneous callers (usually use function that detects number 

p of hardware channels of lines installed, but where the number is stable, this may be hard 

coded). (Restrictions may also be applied where the number of lines is greater than the 
20 number of software licenses.) 

Master. vx then opens a connection to the database in master that is shared by 
the applications. This database can be a local database either within the same physical 
box, co-located in a separate box or device, or located remotely. In one embodiment, 
the database is a Microsoft Access (Access 97) compatible database, however other 
25 database types and formats may be used. Advantageously, Parity VOS includes 

database functionality, so that a database created in Access may be used by VOS 
without requiring Microsoft Access or separate database application program. In one 
embodiment, the same data base file (dictionary32.mdb) is used to store and retrieve all 
of the business, merchant, individual, or other organizational material for both the voice 
30 recognition based talk41 1 and the Internet based talk41 1 . 
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Master. vx code instructions then initialize the Nuance voice recognition that sets 
up the mapping between the inbound call line and the Nuance resources. It also maps 
(or sets up an association or routing of) the Nuance processing channels to the voice 
caller channels. 

Master. vx then spawns or launches one copy of talk4 1 1 . vx application for every 
line installed (subject to the number of licenses). There is an iterative loop that 
increments for the number of lines available. Operation then continues under the 
control of each talk41 l.vx compiled application. 

The Talk4 1 1 . vs script defines how the actual call handling and processing 
happens. The general flow of the call from the talk41 1 code perspective is as follows. 
First, a code procedure module "InitializeO" sets up the telephone line or 
communication channel specific parameters and prepares the line or channel to accept 
a call. This line or channel may be wired or wireless and so the term line or channel is 
used in its broadest sense. Next, a "WaitForCall()" procedure waits for a ring signal 
on the line and when it arrives, takes the line off-hook and prepares it for processing. 
An "Initialize Voice() M procedure module sets up the voice recognition parameters for 
this line and prepares for a recognition. 

A "MainMenu()" procedure is where the call originates after the above 
initialization. At this point we set the Nuance Engine to the " .MainMenu" grammar and 
wait for a result. It returns a series of "slots" each with a valid keyword. The call is 
then processed based on the result of this recognition. If the caller said the name of a 
business, we give them the business information. For example, if the caller spoke a 
"Type" of business, the caller is provided with information on business of that type. 
Alternatively, the caller may either have spoken a command (such as help, quit, member 
information, or the like), in which case the command is handled, or the spoken utterance 
may not have matched any options in the grammar. Failure to match may be due, for 
example to the intended spoken utterance not being defined in the grammar, or the 
quality of the received utterance may have precluded matching. Various programmatic 
options are available requesting the caller to re say the utterance or to provide a 
different command or utterance. The talk411.vx file is the compiled version of the 
talk411.vsfile. 
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A exemplary embodiment of the talk41 l.vs is now described. The talk41 l.vs 
script file and its compiled version talk4 1 1 .vx is application program that controls how 
a telephone call to the talk41 1 service is processed. An interface exists which permits 
signals sent from any telephone handset or other telephony capable device to be 
received and recognized by a computer or more precisely by a circuit or PC board 
designed to handle and process such signals. Telephony-to-computer and computer-to- 
telephony products made by Dialogic that provide this signaling and call processing 
capability are described elsewhere herein. In the recent past, interaction between 
telephone systems and computers primarily relied on touch tone frequency signaling, 
such as may be accomplished by pressing a series telephone keypad buttons to generate 
readily recognizable signals, or equivalently to have a computer program generate such 
tones. More recently, new technology supports spoken voice command and data in a 
system that provides voice recognition to interpret the electronic signal generated from 
the spoken voice, interpret that signal, and convert that interpreted signal into one or 
more commands or data items that can be processed by the computer. Effectively, voice 
is used to give or respond to an inquiry. These hardware and voice recognition 
components are known in the art and therefore are not described here in greater detail. 
References are also provided to companies and organizations providing this type of 
technology as standard off-the-shelf commercial products. 

The talk411.vs, talk411.vx, and other associated or related files provide an 
additional layer or layers on top of the conventional telephony hardware and software, 
the voice recognition software and possible optional hardware or DSPs (voice-to- 
computer), and computer-to-voice (also referred to as text-to-voice), that direct the 
system in handling a call in a particular way and to provide particular features. The 
inventive software and system configuration used in conjunction with existing or 
customized voice recognition features constitute a novel computing and information 
processing machine and method. 

As stated elsewhere in this specification, the Parity Software VOS Version 6 for 
Microsoft Windows is used as a foundation platform. In its native uncustomized form, 
it supports only touch tone signaling, and does not support voice processing or 
recognition in its native form. Nuance has provided a voice recognition software 
(Version 6.2.1) available from Nuance at www.nuance.com . but this software is not 
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directly compatible with the Parity VOS software. Therefore, a third major software 
component integrates the Nuance voice recognition software with the Parity VOS 
software so that the three components accept voice or speech input, interpret it, create 
a character or a string of characters, and provide that character of character string in the 
manner of one or more computer keystrokes or other digital data stream that is readily 
interpretable by the computer. 

More specifically, the Parity VOS software handles the interface to the phone 
system (including how to answer a call when a phone ring indicator signal appears, how 
to play messages to the caller after it answers, and how to accept touch tone signals 
form the caller. It also handles branching to further alternative processing based on 
what action is selected by user, and handles hang-up or call termination by hanging up 
the line, and resetting the line and the system and software to receive the next call. In 
essence is it responsible for call control. In many applications it is the only application 
software that is needed, particularly if the system is a touch tone based systems. The 
Parity VOS software version 6W does not have native voice recognition built in. Parity 
VOS version 7 is available, and it is anticipated that Version 8 will be released shortly. 
Version 8 may provide some additional voice processing or recognition capability, or 
provide an interface or other integration with Nuance Voice Recognition (and possibly 
including others, such as Speech Works software). The interface software is presently 
a piece of software that permits use of Nuance Voice Recognition software and returns 
a result from that software to VOS for processing. 

Parity Software also provides a developers toolkit in the form of a programing 
language and compiler for that language, so that software code, including data 
structures and executable instructions, can be developed to tell the VOS software and 
other components of the system what to do. These developer tool kits and 
documentation for their use are available from Parity Software and are not described in 
greater detail here. The talk411.vs script is developed using this VOS toolkit and 
compiled into talk411.vx. This script and its compiled version executes within the 
hardware to provide control the particular manner in which call flow is handled, what 
inquiries and responses are permitted (in conjunction with various dictionary and 
grammar filed described elsewhere herein), how to interpret specif commands and 
situations. In essence, the talk4 1 1 . vs and talk4 1 1 . vx files control the life of a call. Note 
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that multiple phone lines and interactions on a shared personal computer or server are 
permitted. In one embodiment, twenty- four simultaneous calls on twenty- four different 
lines are supported. With additional hardware boards and an appropriate configured 
computer, additional lines and calls are supported. Portions of an exemplary talk4 1 1 . vs 
file are set forth in the specification. Selected components of the talk41 l.vs script file 
are listed and described and it is expected that those workers having ordinary skill in 
the art in light of the description provided herein, including the description of the 
function and operation, the commercial availability of Dialogic, Parity and Nuance 
Software and Hardware, will understand how particular aspects of call handling are 
implemented without detailed exposition of all portions of the script, dictionary, and/or 
grammar source or executable files. 

The exemplary talk4 1 1 . vs file includes the following components or functions: 
main menu {MainMenu()} , business information {BusinessInfo()} , connect to business 
{ConnectToBusiness()}, more business information {MoreBusinessInfo()}, recommend 
others {RecommendOthers()}, type menu {TypeMenu()} 9 modifier menu 
{ModifierMenu()}, rate business {RateBusiness()}, member entry {MemberEntry()}, 
play promotional message {PlayPromo(ID)} , play coupon message {PlayCoupon(ID)} , 
"record promotional message {RecordPromoMessage()} 5 purchase coupons 
{PurchaseCoupons()} , record coupon message {RecordCouponMessage()} , check score 
{CheckScore()}, category sponsor { Category Sponsor()}, get credit card information 
{GetCreditCard()}, confirm information {ConfirmQ}, get voice response 
{GetVoiceResponse(Grammar, FirstPrompt, SecondPrompt)}, parse input string 
{ParseString(InputString)}, process slot {ProcessSlot()} ? remove input string spaces 
{(RemoveSpaces(InputString) } , remove input string dashes 
{RemoveDash(InputString)} , remove input string parenthesis 
{RemoveParens(InputString)}, save recording {SaveRecordingO}, set end time in 
seconds {SetEndSeconds(seconds)}, initialize {Initialize()}, wait for incoming call 
{WaitForCall()}, initialize voice {InitializeVoice()}, terminate voice 
{TerminateVoice()}, play message {Play(MessageName)}, play message b 
{Playb(MessageName)}, speech process the phone number {SproPhone(Number)}, 
and, play speech message that can be interpreted by the caller {Playl(Prompt)}. 
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These functions are largely self explanatory by their name. For example, the 
MainMenu() function lets the caller select a business, a type of business or enter the 
member area. The BusinessInfo() function reads the business information from the 
database and speaks the business Name, Phone Number, and any optional features such 
as the BayHit Message, Promotional Message, and Coupon Message. The 
ModifierMenu() menu function is used if the caller only said a modifier such as 
Coupons or BayHits. The Playl(Prompt) function will play a speech message that can 
be interrupted by the caller speaking, where the result of their speech is thrown away, 
and where the function returns true if the message was not interrupted and false if the 
message was interrupted. 

By way of further example, the MainMenu() function portion of the talk41 1 .vs 
file is duplicated in Table IV. Exemplary portions of talk41 1. grammar code are also 
providd in Tables V and VI. As will be appreciated from the syntax of the MainMenu 
function in the table, this function lets the caller select a business, a type of business or 5 
entry to the member area. Variables used by the function are reset, a greeting is 
optionally played, and a voice response is received from the caller. A determination is 
then made as to whether they spook the name of a recognized business, or of a business 
type, and whether any business type modifiers were spoken. The function also monitors 
to determine if any commands were spoken, such as for example "Help", "Quit", or 
"Member". 

In particular, the Get VoiceResponse('\ MainMenu", Prompt, "MainMenu") 
syntax results in playing a prompt entitled "MainMenu" that speaks: "Thank you for 
calling talk 41 1, please speak the name of a business or a type of business and we will 
get the information for you." Note that in one embodiment of the invention, the words 
or phrases are played back from earlier recordings, and that in a different embodiment 
they are synthesized based on the words or text. Either approach may be utilized and 
some implementations may utilize a combination of the two. In general, the speech 
quality is better using pre-recorded speech and is preferred. 
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Table IV. Exemplary embodiment of Main Menu from talk41 l.vs script file. 

f unc MainMenu ( ) 

# = =i = = = = == SS = = = !SS = = = = = = = = = = = = = = = = =, = - = == = = = = - = = =:== = = = = = = = = = = = = = = = = = = = = = = — = = 

# This function lets the caller select a business, a type of business or 

# enter the member area 
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do 

voslog { "Line : Line, " Main Menu") ; 

# Reset the VR Variables before getting the next VR 
Business = " 11 / 

Modif ier=" " ; 

Type-" » ; 

Command = " " ; 

if (GreetingPlayed) 

Prompt = "MainMenu" ; 

else 

Prompt = "Greeting" ; 
GreetingPlayed = true; 

endif 

# Get a Voice Response from the caller 
GetVoiceResponse ( " .MainMenu" , Prompt, "MainMenu" ) ; 
voslog ( "Business=" , Business) ; 

voslog ( "Modif ier=" , Modifier) ; 
voslog ( "Type=" , Type) ; 
voslog ( "Command^ " , Command) ; 

# Did they speak the name of a business? 
if (Business strneq " " ) 

Businesslnf o ( ) ; 

else 

# Did they speak a business type? 
if (Type strneq "") 
TypeMenu ( ) ; 

else 

if (Modifier strneq "") 
Modif ierMenu ( ) ; 

else 

# Did they speak a command 
switch (Command) 

case "Help" : 

Play ("Help") ; 
case "Quit" : 

Play ("Goodbye") ; 

TerminateVoice ( ) ; 
case "Member" : 

MemberEntry ( ) ; 
default : 

Play ("Help") ; 

endswitch 

endif 

endif 

endif 

until (Command streq "Quit"); 
endf unc 
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Table V. Exemplary .MainMenu Grammar from talk41 1 .grammar Nuance Voice 
Recognition Component 



.MainMenu 
[ 

Business 
Type 

(Modifier Type) 
(Type Modifier) 
Modifier 
Commands 



Assuming the caller spoke something the system detect what they said. If they 
spoke the name of a business then the string returned would not be empty. If the string 
is not empty, it is processed as if the caller spoke the name of a business. If the type 
variable has been set, the system knows the caller spoke a type of business and we 
process that. The program and system parse through and analyze the results that come 
back from the voice recognition process (Nuance) and process it. Note that the voslog 
operation merely displays to a display screen. It is further noted that this exemplary 
main menu is taken from a simple example of a talk411 script file used for 
demonstration purposes. 

In connection with the get voice response (GetVoiceResponse) operation, the 
".mainmenu" parameter is a reference to a Nuance voice recognition grammar file that 
will be used in processing the voice response. When programming Nuance, Nuance 
requires the identification or definition of each symbol, word, number, or phrase that 
a caller may say in response to the prompt. Many or even most common words and 
numbers, as well as common phrases are defined in files provided by Nuance; however, 
current versions of the software do not allow unlimited words such as might be 
supported by desktop voice recognition applications. That additional level of flexibility 
and sophistication invariably require prior training by the speaker so that the speaker's 
voice characteristics are understood by the speech or voice processing software. 

Within the ".MainMenu" portion of the talk41 1. grammar file are six top-level 
grammars: Business, Type, Modifier Type, Type Modifier, Modifier, and Commands. 
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Anything with a capital letter as the first letter, such as capitalized "Type" means that 
the item or type is defined later on in the file or in another file. Further down in the 
same talk41 1. grammar file appears a block of code pertaining to further definition of 
the main menu "Type" top-level grammar. In what follows are a set of definitions, 
including definitions for such groups as attorneys, auto parts, and the like as indicated 
in Table V. 

With respect to the "<Type "Attorneys'^ in the right-hand column, there are 
three ways in this example that a caller might refer to attorneys, including "attorneys", 
"lawyers", or "ambulance chasers". Clearly other deserved or undeserved names might 
be included. These are all the ways someone might say attorneys. In similar manner, 
the <Type "Auto Repair & Service "> provides several different ways in which a caller 
may request this type of business, including: "auto repair", "auto service", "auto repair 
and service", "auto service and repair", "auto repair service", "auto service repair", and 
even "auto repair and repair", "auto repair repair", "auto service and service", or "auto 
service service" though the later four combinations are unlikely to occur. Note that in 
the Grammar Specification Language (GSL) in which the definitions are written, the 
square brackets "[...]" mean "or", the parentheses %..)" mean and, and the "?" mean 
that the item is optional, have meaning such as "and" or "or" which is part of the 
Nuance "grammar specification language" GSL. The "?" means that the term is 
optional. For example, one can merely say the word "hamburger" (as well as 
"hamburger restaurant") and the system will respond as if the caller said "hamburger 
restaurant". 

Note that the "Businesses" are all included in another file called 
"talk411businesses.grammer", but it is defined here (under the .MainMenu). The 
talk41 1 business. grammer is a file the contains all of the actual business names with 
variations on the way a caller may refer to a business. For example, a caller may speak 
either "Cisco" or "Cisco Systems" and mean the same business. 

The other elements such as "Modifier" are identified with or mapped in similar 
manner using GSL. Within "Modifier" the parenthetic lowercase "coupon" is the actual 
word coupon that is spoken. These are the things that can be used to modify business 
or type. One notes that in the main menu, provision is made such that a caller can say 
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"coupons for hamburger restaurants" and the system would extract both the modifier 
and the business type. Understanding the way people speak and providing the 
alternative expressions in the system is beneficial as it encourages individual consumers 
to use the system. 



Table VI. Exemplary "Type" grammar from talk4 1 1 .grammar file for Nuance 
Voice Recognition Component 



Type 



[ 



[attorneys lawyers (ambulance chasers)] { <Type 

"Attorneys " > } 

(auto parts) {<Type "Auto Parts" >} 

(auto [repair service] ?and ? [repair service]) { <Type "Auto 
Repair & service" >} 

(Chinese ?food ?Restaurants ) 

Restaurant 11 >} 

(hamburger ?Restaurants) 



Restaurant " > 



Restaurant " > 



Restaurant " > 



Restaurant " > 



(italian ?food PRestaurants ) 
(japanese ?food Restaurants) 
(mexican ?food PRestaurants ) 



{<Type "Chinese 

{ <Type " Hamburge r 

{ <Type "Italian 

{<Type "Japanese 

{<Type "Mexican 



( [physicians surgeons] ?and ? [physicians surgeons] ) {<Type 
"Physicians & Surgeons" >} 

(pizza ?Restaurants) {<Type "Pizza" >} 

([steak seafood] ?and ? [steak seafood] ?restaurants) {<Type 
"Steak Sc Seafood Restaurant " > } 

] 



The grammar files are ultimately compiled to create a grammar library that 
identifies all of the various permitted combinations and permutations that are matched 
to. One may appreciate that for a large vocabulary, this data structure can become quite 
complex. A library of spoken words and phrases are used that are used to match and 
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recognize the permitted words and expressions. In one embodiment, the libraries are 
built using the spoken expressions of human speakers. 

The talk41 1. dictionary file is a file that contains an explanation of words or 
phrases that are not in the Nuance engine. This file is needed when one compiles, 
however the file may be and frequently will have no entries. It contains the phonetic 
GSL description of the words or phrases that are not known by Nuance. 

The Nuance Voice Recognition System was developed using source files called 
"grammars' 1 which define the range of speech a caller can say in response to a given 
prompt. Like VOS scripts, once a grammar has been defined, it too must be compiled 
using the "compile-grammar" command. A Nuance license has heretofore been 
required on a per-port basis and the Nuance license has been controlled by a software 
key. 

Grammars may conveniently be characterized as either "Top-Level" grammars 
and "Sub" grammars. Top-Level grammars are initiated from the VOS script. Sub 
grammars are essentially the building blocks used to creating a Top-Level grammar. 
Each Top-Level grammar defines all of the allowable responses a caller can say in 
response to a voice prompt. A simple example of a top level grammar would be 
".Confirm". The ".Confirm" grammar may be used in response to a voice prompt such 
as "Is this correct?" The ".Confirm" would be constructed of two sub-grammars, "yes" 
and "no". Each sub-grammar would include all of the various ways a caller could say 
"yes" or "no". 

It is noted that these grammar files having all allowable responses may be seen 
as being somewhat limited given developments in speech processing and recognition. 
It should however be understood that the software and system are intended to recognize 
a response from some arbitrary speaker who may have never accessed the system before 
and has not trained the system or software with their voice. The circumstances may 
therefore differ from those present for various commercial voice or speech recognition 
products, such as those made by Lernout & Hauspie Speech Products N.V.; Flanders 
Languages Valley, 50; 8900 Leper, Belgium or Lernout & Hauspie Massachusetts 
Office, 52 Third Avenue, Burlington, MA 0 1 803 . International Business Machines also 
provides continuous speech recognition products. These products permit a user to train 
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the system to recognize virtually any words, and take into account not only the speaker's 
voice but also the microphone, memory, processor, and program application 
characteristics of the overall system. It may be expected that in time, the inventive 
system and method would be compatible with such as system; however, as the number 
of possible inquiries and responses are limited and may be known in advance, the use 
of such grammar files does not present any practical limitation to the invention. 

Grammar files are designated by a .grammar extension. At least two additional 
files are required before a grammar can be compiled: a .slot-definitions file and at least 
one .dictionary file. The .slot_definitions file defines slots for natural language 
recognition. The .dictionary file (or files), for example the talk41 1. dictionary file, 
defines words that are not in the Nuance grammar and thereby allows the total 
vocabulary which may be spoken and recognized to be extended. 

Embodiments of the inventive system and method provide the following five 
vocabulary related files. The "Talk41 1. dictionary" file defines words that are specific 
to the talk41 1 system and therefore not in the Nuance grammar. (It is noted that future 
versions of Nuance may be modified to include talk411 words. The 
"Talk41 1. grammar" defines the Talk41 1 application grammar. A "talk41 1. missing" 
file is a file that stores the result of the compile and indicates words not in the Nuance 
grammar. The "Talk41 l.slot definitions" file defines "slots" for the natural language 
recognitions. 

The talk41 1. grammar file is the main file that links any and all other grammar 
files provided by Nuance. Any file that is a ".grammar" file is defined by an include 
statement in the main talk4 1 1 .grammar file. Most of these .grammar files are provided 
by Nuance, such as grammars for obtaining and processing commonly used credit card 
expiration dates, through their commercially available tool kits. Note that in addition 
to the talk41 1 specific grammars, the main talk41 1. grammar file includes an initial set 
of include statements that link the other grammars. A portion of the code is provided 
in Table VII so that this grammar structure and relationship is clearly set forth. 

These "included" files handle, for example, common expressions, yes/no 
grammar, number grammar, thousands grammar, credit card number grammar, credit 
card expiration date grammar, and a larger talk41 1 business grammar that is a grammar 
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containing the talk41 1 business, merchant, and organization names. In this particular 
embodiment, all of the grammars except for the talk41 1 business grammar are provided 
by Nuance, and are common grammars that are used for many different voice 
recognition applications. 



m 
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Table VII. Exemplary talk41 1. grammar showing include statements to Nuance 
grammar files 



Talk411 Nuance Grammar Definition File 



Talk411 -Grammar 



# inc lude " common . grammar " 

#include "number . grammar" 

# inc lude " ye sno . grammar 11 

#include "thousands .grammar" 

# include " Credit CardNumber . grammar" 

# include "CreditCardExpiration . grammar" 

#include "Talk411Businesses . grammar" 

. MainMenu 



[ 



Business 
Type 

(Modifier Type) 
(Type Modifier) 
Modifier 
Commands 



. Type Menu 



Business 
Commands 

(more ?businesses) 



{ < Command " More " > } 



] 



. . . (additional items not shown here follow) 
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The "Talk41 1 Business, grammar" lists all of the business in the system. In one 
embodiment, this Talk41 1 Business. grammer is a dynamic grammar that can be added 
to or modified on the fly, that is as the system is up and running and being accessed by 
callers. In the other embodiments, such as developmental versions, this grammar is a 
static grammar meaning each business is added manually. In this manual version, the 
grammar is recompiled and the system restarted after making any changes to this file. 
An exemplary sample entry in the grammar is listed below: 

(straw hat pizza) {<Business "Straw Hat Pizza">} 

The entry on the left (straw hat pizza) is the actual phrase of set of words the caller must 
say or utter to select this business and the entry on the right is slot (Business) and the 
keyword (Straw Hat Pizza) that would be returned to the application for processing. 
Note that in this particular embodiment, the entry for the keyword must match the 
corresponding entry in the database exactly for a match to occur. Also recall that entries 
in the database and in the grammar may be established for variations of the phrase. For 
example, there may be an additional entry for "Straw Hat" that does not include the 
word "Pizza". Therefore the requirement for exact matching is not unreasonable given 
that all of the possible variations that may match are made available. 

The talk41 1 business are just a list of business, such as straw hat pizza, Michael 
Italian foods, or other business names. These are just examples, and the names are put 
in a separate file for convenient administration. Ultimately, a business will be able to 
sign up without any human intervention using technology currently being developed. 
Under these systems, it will merely be a matter of the user typing in the name, and 
allowing the system to process it. If it can be recognized without help that will suffice, 
however in some instances it is anticipated that a user may have to say the new name 
one or more times. It is also possible to specify the phonetic characteristics using GSL 
or other phonetic descriptive language, however this skill would not generally be 
available to lay people using the system. Nuance also provides for "dynamic 
grammars" that permits adding entries to a running application on the fly. In this way, 
business subscription or membership may be implemented automatically without 
requiring system restart or the like. Nuance and Talk41 1 grammar files are listed with 
a description as to their function in Table VIII. 
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Table VIII. Nuance and Talk 411 Grammar Files 


common.grammar 


Nuance provided grammar of common expressions 


CreditCardExpiration. grammar Nuance provided grammar for credit card expiration 


CreditCardNumber. grammar 


Nuance provided grammar for credit card numbers 


MAKEGRAM.BAT 


Batch file for compiling a grammar 


number.grammar 


Nuance provided grammar for numbers 


Talk4 1 1 .dictionary 


Talk41 1 dictionary file defines words not in the Nuance 




grammar 


Talk4 1 1 .grammar 


Talk411 application grammar 


talk41 1. missing 


Result of compile that shows words not in the Nuance 




grammar 


Talk41 l.slot_defmitions 


Defines "slots" for natural language recognitions 


Talk41 1 Businesses. grammar 


Talk41 1 business 


thousands.grammar 


Nuance provided grammar for numbers in the thousands 


yesno. grammar 


Nuance provided grammar for yes/no responses 



The inventive system and method also utilize a database file that stores 
Customer or Business information. In one embodiment, the database file 
(directory2.mdb) is created as a Microsoft Access 97 database file. Microsoft access 
was selected for this embodiment, as the other program modules making use of the 
database are able to store and retrieve information from the file without having Access 
program running. Other database structures or programs may alternatively be utilized. 

In one embodiment, the database named "directory2.mdb" includes a table 
named "Customers" and Field Names: MemberlD, CompanyName, PhoneNumber, 
Address, City, Region, Category, Specialty, BayHitScore, BayHitRates, Coupons, and 
CouponMessage. These fields include entries which are clearly optional in character 
and pertain to certain specific features provided by the inventive system and method. 
For example, the BayHitScore, BayHitRates, Coupons, and CouponMessage fields 
provide novel informational features not present in existing systems or methods. In 
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fact, nearly any database fields may be provided, it is merely a matter of identifying the 
type of information or content that is to be provided in response to a caller. It is 
anticipated that these informational or content items will include some identification 
for the business or entity (company name), telephone contact information (phone 
number), and address information (city and street address). It may also be desirable to 
provide some regional information separate from the city and street address (regions), 
and such region information may define a local neighborhood, or a multi-city or 
metropolitan area. Driving instructions from key recognized points of interest or 
locations may also be provided. Specialty may permit a member business to identify 
a particular product for which they are noted. For example, an Italian Restaurant may 
wish to identify themselves as specializing in Italian Vegetarian Cuisine, a night club 
as specializing in swing dance music, or a book store as specializing in first editions. 
There are a myriad of other fields that could be provided in the database and for 
response. 

It is noted that in one embodiment, the same database is used for the voice 
accessed and for the Internet accessed system. In the voice accessed system and 
method, the information may necessarily be limited to a set of words, short phrases, or 
expressions to simplify the voice recognition and text-to speech processes. It is also 
generally true that individuals accessing the inventive system from a telephone would 
prefer short to the point answers or information rather than lengthy explanations or 
pitches, particularly if such access is made while driving from a mobile telephone. 
Therefore, a single database may include entries that are common to voice and Internet 
accessed systems and those that are not common. For example, an additional set of 
fields may be provided that are served only when accessing over the Internet. 
Alternatively, two different databases may be provided which are optionally 
synchronized. One embodiment, maintains separate databases that are regional in 
character with the concept that a caller in New York City would not need to access the 
information for businesses or organizations in San Francisco. 

Such a database is defined in a memory storage device (either in solid state 
memory or bulk storage device) on a computer system. The computer may for example 
be a server computer, many types of which are known in the art. The computer and any 
attached storage may be on the same machine or on a different machine. Structures and 
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methods for building, maintaining, and accessing databases are known in the art and not 
described in greater detail here. Several fields for an exemplary customer data are listed 
in Table IX. These fields include the Company name and phone number, as well as 
optional but advantageously provided data fields for address, city, region, category, 
5 specialty, bay hit score, bay hits rating, coupons, coupon message. A member ID is also 

provided for accounting and identification purposes. 



Table IX. Exemplary Customer Database Fields 
Field Names: MemberlD 

CompanyName 

PhoneNumber 

Address 

City 

Region 

Category 

Specialty 

BayHitScore 

BayHitRates 

Coupons 

CouponMessage 



Having described the software modules and procedures present in an 
embodiment of the inventive system, attention is now directed to operation of the 
25 system. In order for the system to be started, each software module must be launched 

in the proper order. In one embodiment, the order for launching and a description of 
each module is listed below. It is noted that this ordering applies the particular scripts, 
compiled scripts, programs, or procedures as they presently exist and that variations in 
this order may be anticipated for a different set of scripts, compiled scripts, programs, 
30 or procedures. 

First, the "license. exe" is launched. This is the Nuance license engine that must 
be launched with a valid "license.txt" file which controls the available recognition 
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sessions that are allowed on a given system. (This program file is executed as a 
condition of using the Nuance software and is not otherwise required for system 
operation.) 

Next, the "resource-manager.exe" program file is launched. This is a Nuance 
program which is like the gatekeeper for controlling the Nuance sessions. The 
"recognition-server.exe" is then launched and provides the Nuance grammar engine that 
is started with the grammar "package" defined for the application. 

A VOS-Nuance interface "VRServer.exe" program is then launched to provide 
an interface between VOS and Nuance. Finally, the "VOS6W.exe" program file is 
launched. This VOS6W.exe file provides the Parity VOS application engine, and must 
have an identified valid .vx (compiled VOS script) file specified in order to start. 

Conveniently, an executable file (StartVR.exe) is provided to facilitate this 
ordered launch and execution process. This executable performs the above tasks and 
monitors the program modules as they are running. 

One embodiment of a particular example of the general flow of new business 
user (new merchant) 201 interaction with the inventive system and method according 
to one embodiment of the invention is now described with general reference to 
FIGs. 1-4. A business user 201 is a user that is providing goods or services to 
consumers 101 where consumers also refer to the previous caller 101. The business 
user is desirous of having their goods and services made available to consumers over 
the inventive system and in promoting their goods and services to consumers. 

Once the business user 201 calls, and gets identified as a new business user 
utilizing a business user registration procedure the business user is asked to say certain 
business registration information, including for example their name, name of the 
business, phone number, credit card number, and/or other pertinent business 
information. Once the registration information is obtained, the system 102 compares the 
information provided by the new business user 20 1 with the information that resides in 
the database 134. The database includes information regarding business so that the 
authenticity of the attempted registration can be verified with reasonable assurances. 
In the even that the information does not match, the system 102 may connect to other 
databases in an attempt to verify the authenticity or otherwise complete the registration. 
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If the information matches, the user registration is completed. If the information does 
not match, the user is notified with a message providing the new business user with 
additional options or information, or to recommend trying to say the information again. 
For example, in one embodiment of the invention, the new business user is prompted 
with the audio message " Sorry, but the information you have provided does not seem 
to be correct, say 'again* to start over. You can also hang up and call again, say 'help', 
or register at our web site www.Talk41 1 .com". 

Once the new business user 201 is registered and a password is issued to the 
business user, then he or she is requested to record a short message that will be heard 
by the callers 1 0 1 who request the business user's phone number. Voice recognition can 
be used (in conjunction with an optionally previously stored authentic voice print of 
the business user) to authenticate the business user 201 in addition to or instead of the 
password 212 depending on the quality of the speech recognition technologies used and 
the quality of the line or other communication ling connecting the business user to the 
system at the time. So called "caller identification" available in some areas may also 
assist in verifying the identity of the business user where the business user would then 
be required to call from a registered telephone number. 

Once the business user approves the short message just recorded, the recorded 
short message is published then he or she is requested to provide an additional longer 
message that may be or include a special promotion, directions to the business location, 
or any other information that will provide additional information to the callers. Both of 
these short message and long message are available for playback to callers and can also 
be viewed in text form by those who visit the web site and look up that particular 
business. The new business user can change either message completely over the phone, 
or edit it word by word on the PC connected to the web site. Opportunities to subscribe 
as a category sponsor, and to record and publish a category sponsor message, provide 
a service description, and to review or change messages are provided. 

It is noted that the messages provided by the business may either be a 
representation of the business representatives own speech which is preferred so that the 
quality and character of the voice is maintained, or the message may be computer 
synthesized speech. The later being necessary if the business chooses to provide or later 
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modify the message using text input on a computer. As maintaining original speech 
may be somewhat cumbersome, additional fees may be levied on the business for 
providing actual speech as compared to synthetic speech. Alternatively, the business 
user may be able to select from a set of available synthesized voice types so that the 
voice, even though not provided by the business directly, provides the intended feeling 
or emotion associated with the business. For example, a restaurant may wish to convey 
the feeling of romance. 

In yet another alternative, the business user may choose a voice talent to record 
the message, or choose to have synthesized speech simulate a celebrity voice either 
living or deceased. 

Having described the general operation of the system during a new business 
interaction, we now describe one particular exemplary embodiment of the new business 
interaction procedure. As illustrated in FIG. 3, the business user calls the decision is 
made as to whether the calling business user is a new user or in existing user. Once it 
is determined that a new account needs to be established, registration of the new 
business account proceeds as described above wherein the calling business user 
provides certain business registration information register the new account (Step 302). 
The registration information provided by the registering new business user must be 
verified (Step 304) before the new business user interaction can continue. If 
verification cannot be made, then the interaction is terminated (Step 306), otherwise 
the business user is prompted to record a short message (Step 308). The business user 
can then approve the recorded short message 2 14 or change the recorded short message 
until the business user is 'satisfied with the recorded short message an approves it for 
publication (Step 3 10) at which time the short message is published in a voice form and 
in text form on web site 134 (Step 312). 

Business user 201 is then prompted to optional record either no message, and 
long message 218, or sponsor message 219 (Step 314). In the event that the business 
user to record no additional messages, the business user is thanked for providing the 
information (Step 334) and the interaction terminates (Step 336). If the business user 
chooses to record a long message than the business user records the long message (Step 
316) and a given on opportunity to approve the recorded message or change that 
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message until the business user is satisfied with the recorded long message (Step 3 1 8). 
The long message is published (Step 320) and the business user is again the given 
opportunity to learn about category sponsorship (Step 322). If the business user 
declines the opportunity to learn about category sponsorship, the business user is 
thanked for providing the information (Step 334) and the interaction terminates (Step 
336). On the other hand, if the business user indicates a desire to learn about category 
sponsorship he or she is provided with the description of the sponsorship service (Step 
324) and again asked if he or she wishes to subscribe to the category sponsorship 
service (Step 326). A category sponsorship message is a message will come up when 
the caller requests businesses in a category without a specific business name in mind. 
Then the system will play back the message of the sponsors in that category in a 
pre-determined order, random order or a dynamically defined order based on some 
policy or membership level or category, or other rules. If the business user declines the 
opportunity to subscribe, the business user is thanked for providing the information 
(Step 334) and the interaction terminates (Step 336). If the business user indicates a 
desire to subscribe, he or she is given opportunity to record a category sponsorship 
message (Step 328) and further opportunities to either approve or change the message 
until he or she is satisfied with the recorded category sponsor message (Step 330). The 
category sponsor message is then published (Step 332) and the business user is thanked 
for providing the information (Step 334) and the interaction terminates (Step 336). 

As illustrated in the flow chart diagram of FIG. 4, the procedures associated 
with the repeat or registered business user interaction are substantially the same as, 
though not identical to, those just described for a new business user interaction. The 
differences primarily concerned how the initial phase of the business user call to the 
system is handled. For in existing registered business user interaction, the system 
receives the business user call and determines if it is a new user or an existing registered 
user (Step 352). If the system determines that it is a new business user, then the 
procedure already described relative to FIG. 3 is executed. However, if the system 
determines that an existing registered business user is calling into the system, it 
presumed step the existing business user wishes to make changes to one or more of the 
items of registration information or to one or more of the recorded messages (Step 354). 
If the business user decides after placing the call that he or she does not wish to make 
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changes than the interaction terminates (Step 356), otherwise the business user is asked 
whether he or she wishes to change the short message, the long message, or the sponsor 
message (Step 358) and given opportunity to change one or more of these messages. 
These messages are the Long, Short and Sponsor messages that he may have already 
input into the system via a phone or a personal computer of other information appliance. 
The process for recording, changing, approving, and publishing each of these messages 
is similar to that already described relative to FIG. 3 and indicated in the FIG. 4 flow 
chart, and the business user it is similar given additional opportunities to learn about, 
subscribe to, and record messages pertinent to additional services provided by the 
system. 

In order to make the user interface more satisfactory, additional steps can be 
introduced or some of the shown steps can be deleted from the interaction flow. For 
example, after the Business User makes changes to the short message, he can be 
prompted to see whether he wants to make any changes to the long or the sponsor 
messages. An example of the deletion of a step can be where the user is initially 
prompted to find out whether he wants to make changes and gets told that he can say 
anytime "make changes" and trigger the menu options. The amount of consolidation 
largely depends on the speech recognition technologies employed and the key words 
chosen for the speech recognition vocabulary. 

In addition, other' embodiments of the invention may largely or entirely 
eliminate the particular command and data extraction procedure set forth in the above 
described procedures and replace them in all or in part by a natural language recognition 
and extraction procedure that either listens to the user's request in free form speech and 
extracts commands and/or data from the user's speech, or extracts the commands and/or 
data in part and intelligently asks additional questions of the user for any added 
information. In this sense, the inventive system and method provide logic for 
conducting a dialog or conversation with the caller. Essentially the same or 
substantially the same information is exchanged between the user and the system but 
with a more flexible interface that is more familiar and enjoyable to the user. 

Those workers having ordinary skill in the art in light of the description 
provided here will appreciate that the procedures described for existing registered users 




as well as for new business users may be modified to provide somewhat different 
options at each stage of the interaction or to provide different ordering of the options. 
Therefore, the interaction described here are merely exemplary of the type of business 
user to system interaction desirable in an implemented system, but does not limit the 
inventive system or method to these particular interaction schemes or procedures. 

FIG. 5 shows an exemplary embodiment of the General User interaction 402 
such as might occur when a consumer calls for information. Once the consumer user 
1 0 1 calls, a greeting message 382 is played back (Step 404) such as "This is TALK4 1 1 , 
your best source for local information" followed by a sponsor message 384, such as for 
example "brought to you by Dialsurf, bringing the web to your phone" (Step 406). This 
sponsor message is typically a paid message by a sponsor. Then the voice menu 386 is 
played back to the caller (Step 408), such as "Please say your selection: Restaurants, 
Lawyers, Auto dealers, etc.". Once the consumer user says one of the menu items (Step 
410), then he or she is prompted with a request message (Step 412), such as "Say the 
name of the business or say 'select" 1 . 

If the consumer user 101 says "select" or another word identified to the system 
that indicates to the system that he or she (the consumer user) should be prompted with 
a list of pre-selected business names, he or she is prompted with a request to specify 
selection criteria (Step 414). This criteria 388 is pre-determined and varies according 
to the type of business. In the case of restaurants for example, it may be "type of cuisine, 
city and zip code". In case of lawyers, it may be for example "type of practice, city and 
zip code". Once the user says the criteria 388, then the system 102 tries to match the 
requested category or criteria 388 to the closest category or criteria stored in the 
database 134. If the match is good (according to some predefined rules or decision 
algorithm or procedure), then the system will play back a number of business names 
pre-determined by the system (Step 41 8). These names can be picked from the database 
134, in the requested category, in a pre-determined fashion, randomly or based on a 
dynamically changing criteria or some fixed set of rules. 

The inventive system, method, and business model or operating method is 
applicable to a broad variety of business and merchant types including but not limited 
in any way to: restaurants, physicians and surgeons, auto parts, auto repair and service, 
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pizza, auto dealers, department stores, attorneys/lawyers, dentists, hospitals, insurance, 
beauty salons, banks, plumbing contractors, florists, as well as many other types of 
businesses and services. The system may also be adopted to professionals, or other 
individuals or organizations other than those specifically qualifying as businesses or 
merchants. 

One example of pre-determined way is for the subscribed businesses to pay the 
corresponding fees to be included in the top category (Category #1), second category 
(Category #2), and the like for a specific time period. An example of random procedure 
can be, as the name implies, based on a random number generator that picks a database 
record in the category requested. An example of dynamically changing criteria is when 
users rate the businesses on a real time or periodic basis and which ever business is 
rated highest gets to be heard as the #1 (first named), #2 (next named), #3 (third 
named), and the like down a hierarchical or other list. 

After the pre-selections are played back, the caller is invited to say the 'number 
of the menu selection' or to say 'more" (Step 454), if the caller responds with the 
number of the selection, the number and a short message is played back (Step 450) and 
he or she gets prompted with a questions such as "Say connect or 'more* for additional 
information" (Step 452). If the caller says "connect", the caller is connected to the 
phone number that was found (Step 444). If the caller says "more" (Step 436) then the 
pre-recorded Long Message is played back (Step 438) with a question such as "Say 
connect or just hang up your phone" (Step 440). Based on the caller's selection, either 
the caller gets connected to the phone number (Step 444), or gets disconnected to the 
service (Step 446). Of course, different rules may be applied to permit the user to input 
different choices, however, in some situations it is desirable to have a user call in again 
when they have rethought their need rather than to tie up the connection for an extended 
period of time. 

If the caller responds by saying 'more', then additional pre-selections are played 
back to give the caller more and different choices (Step 456). The caller may then 
either say the number of one of the new selections (Step 458) or terminate (Step 446). 
In some instances, the caller may be permitted to keep repeating the request for more 
choices until all choices available in the data base (or a predetermined number of such 
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choices) have been presented to the caller. In either event, if the caller does not like 
or select one of the available choices, the call terminates (Step 446). 

If there is no match (Step 420), an answer such as "sorry, but we cannot find 
this category in our list, try again" is played back (Step 422). After a predetermined 
number of tries (for example, after two tries) if there is no match, the system will say 
something like "sorry, we could not find a business that matches your request, please 
call us again" (Step 424) and terminate the call (Step 426). If there is a close match, 
the system will play back the match to verify the request for further action (Step 428). 

Once the caller chooses the business by saying its name or menu number (Step 
430), the number and a short message is played back (Step 450) and he or she gets 
prompted with a questions such as "Say connect or 'more' for additional information" 
(Step 452). If the caller says "connect", the caller is connected to the phone number that 
was found (Step 444). If the caller says "more" (Step 436) then the pre-recorded Long 
Message is played back (Step 438) with a question such as "Say connect or just hang 
up your phone" (Step 440). Based on the caller's selection, either the caller gets 
connected to the phone number (Step 444), or gets disconnected to the service 
(Step 446). 

If the caller says the name of a particular business instead (Step 448), then the 
phone number and the Short Message (refer to FIG. 3) will be played back (Step 450) 
with an additional prompt (Step 452), such as "Say 'connect' or 'more 1 for additional 
business information (the Long Message per FIG. 3). Once the Long Message is played 
back (Step 438), the user will be prompted once more whether the connect or terminate 
the call (Step 440). 

FIG. 6 shows an exemplary implementation of the Directory Service of the 
information center 136, where the Web Server 471 serves VXML or HTML/XML 
pages 470, the LDAP Server (Lightweight Directory Access Protocol Server) 472 runs 
over TCP/IP and provides quick response to high volume lookup to the Database 473. 
It is noted that this is merely one example of how an information center may be 
organized and that its computer or server based organization may be integrated with 
various types of call centers, service centers, and the like as are known in the telephony 
and server technologies. The Middleware 474 is the layer of software that integrates 
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operations of the Web Server, LDAP Server, and the Database (or any additional 
software such as Transaction Server). LDAP Servers and operation are known in the 
art, and is described, for example as of 23 March 2000, at: 



An example of a Web Server for high volume application such as TALK41 1 is 
the Microsoft Internet Information Server (IIS). Microsoft IIS runs on Windows NT 
Server. LDAP also runs on Windows NT® 4.0 using Service Pack 4 or later, 
Windows® 2000, or Windows 95/98. All systems desirably have TCP/IP (or an 
equivalent capability) installed. Additional information relative to Microsoft products, 
including Microsoft IIS is available on their website as of 23 March 2000 at 
http://msdn.microsoft.com/isapi/msdnlib.idc?theURL=/library/psdk/ldap/ld 7euh.htm 

Other optional but desirable features may also be provided. For example, one 
desirable promotional feature involves issuing an audio coupon (also referred to as 
voice coupon) to a consumer user of the inventive system. In one embodiment, a 
consumer user is issued an audio coupon entitling the user to a promotion. Typically, 
such promotion would entitle the user to a discount to be applied to the item or service 
purchased when the consumer user connects using the inventive system and method. 
This discount, for example 1 0 percent off, would only be available to the consumer user 
when using the inventive TALK411 system and is therefore an enticement for a 
consumer user to use the inventive system rather than dealing with the business through 
conventional means. Other promotions might involve a buy one get one free offer, of 
free drink with order of food type offer, or any other the other variety of promotional 
offers typically made in the retail trade between merchants and consumers. 

The audio coupon may be provided in a variety of ways. For example, the 
business would become aware that the consumer user contacted the business using the 
inventive TALK41 1 system and automatically give the consumer user a discount (or 
other promotional item) when the your was placed. Alternatively, the consumer user 
might be given a coupon code which could only be available to a consumer user who 
ate utilize the inventive system, and a consumer user would provide this code to the 
merchant upon connection. This code might be generic to the business or particularized 
to that specific transaction. Therefore, in addition to be self promotion aspects of the 
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business model, the optional use of audio coupons also provides considerable business 
advantages. In one embodiment of the invention, the system inserts a message to the 
merchant after the call has been connected to identify the caller as a valid service user 
and to validate the audio coupon. 

In a preferred embodiment, the use of audio coupons is integrated with the world 
wide web or Internet in that the audio coupons may be identified, stored to, retrieved 
from, or otherwise processed using the businesses or the inventive services web site. 
In this way, the consumer user is not limited to using the coupon at the time it was 
earned, but may instead be collected for later redemption. This also affords an 
opportunity to obtain a printed copy of the coupon for use at any later time. 

In yet another embodiment, the audio coupon or a coupon derived from that 
coupon may be delivered to a personal data assistant or PDA (such as for example, an 
email enabled PALM VII) so that the PDA stores the coupon and serves as a medium 
for displaying the coupon the business, merchant, or organization. 

Independent of how the coupon is delivered, one aspect of the inventive system, 
method, and business model is to collect money or other revenue in what ever form for 
each coupon delivered. It is also advantageous to collect money or other revenue for 
each coupon redeemed either as a fixed amount per coupon or as a portion (such as a 
percentage) of the sale, or both. Collection of revenue for each coupon delivered is 
separate from collection of revenue for each redemption or sale. 

A Professional Message Conversion Service and Method Embodiment is 
optionally provided and provides a capability for converting recorded or typed 
(character based) promotional messages to professionally recorded message. Once the 
user records the promotional message and saves it, the user is prompted to keep the 
message the way it is recorded, or to get the message re-recorded by a professional 
audio talent. Then the recorded message is sent to the mailbox of the audio talent. He 
or she records the message and saves it. An email is automatically sent to the user 
notifying that the message is recorded and will be activated once he/she reviews it. 

In a further embodiment, the system has geographical context provided by a 
known location of the caller. For example, it is expected that mobile or cellular 
telephones will have capability to self locate, either using internal satellite-based 
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Geographical Positioning System (GPS) means or by using various schemes known in 
the art for determining (or estimating) the location of a cellular telephone based on 
proximity to cellular base stations, hand-off s to base stations, and similar techniques. 
In any event, the inventive system provides for geographically-based recommendations, 
geographically-based promotions, as well as for geographically based audio coupon 
delivery. Here the geographic proximity may be established according to some set of 
rules which may for example depend upon the density of business establishments in the 
local area. However, in one embodiment the geographically directed audio coupons 
pertain to business within one to a few blocks of the callers location, in other 
embodiments to a mile or two, and in still other embodiments to the region of the city 
or town. Caller ID may alternatively be used when an address associated with the 
telephone number is available. 

In yet another optional feature, consumer user's who call into the inventive 
service will be able to rate the particular business after they have utilized the businesses 
goods or services. For example, a consumer user having been referred to a restaurant 
using inventive system can later call in using a toll-free or free local phone number and 
provide feedback, such as in the form of a rating, relative to their experience. These 
ratings would then be compiled and made available to the local businesses. Hopefully 
such feedback would encourage the businesses to either maintain their high quality of 
service or to improve the quality of their service and/or goods in response to the 
consumer user's rating. 

In another embodiment, these ratings were also serve as an additional 
information source for consumer user's and would be available either or telephone or 
on Internet based website. The business establishment having demonstrated a 
particularly high-level of goods or service based on these ratings would be placed into 
a category of highly rated businesses, such as "BayHits", would be available to the 
consumer user during his or her call into the system. So for example, when the user 
calls in to request "Italian restaurant in Palo Alto", if in one of the candidate restaurant 
played back to the caller happens to be a "BayHit" then that restaurant would be 
indemnified as such. For example, the caller might receive a message "II Fornaio - a 
BayHit". Alternatively, consumer user may be able to request "BayHit Restaurants" 
and receive only a list of restaurants satisfying the BayHit criteria. In some embodiment 
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of the invention, the rating or BayHit feature may be provided free to the businesses 
while in an alternative preferred embodiment businesses falling within the highly rated 
or "BayHit" category would be charged the nominal fee. Those workers having 
ordinary skill in the art in light of description provided here will appreciate that this 
rating and promotion scheme may be implement in a variety of ways and that the 
particular descriptions provided here are merely exemplary of the more general method. 
The ratings may alternatively or additionally be provided on an Internet website (such 
as http://www.bavhits.com so that information obtained from caller's using inventive 
system method would be available to other individuals and businesses as well. 

BayHits is the name for a promotional vehicle for local businesses to promote 
their products and services through the telephone and the web to the San Francisco Bay 
Area (or any other) population. Analogous names may be applied to other regional 
markets. Businesses offer a promotional discount of some sort in order to participate 
and consumers have to register and participate in the "ratings" in order to earn points 
redeemable towards subscribed discounts. For example, John Doe registers either at the 
web to be a BayHits Rater. He is immediately given some points (for example 500 
points). He can spend these points in exchange for "deals" from participating 
businesses. Once he calls in and rates one of the participating vendors, he will earn 
points for each rating. 

Ratings will be made available to the participating vendors on the web site. Each 
participating vendor will be able to see the number of redemptions, ratings for their own 
business compared to the rest of their category without specific business names. This 
will help them understand where they stand against their competitors and give them a 
chance to change their promotional strategy. This is a win-win program, since 
participating consumers continue to win points, and hence discounts from participating 
vendors. Participating businesses win because they get increase in patronage and get 
real time feedback through the phone or the web. If businesses get high ratings, they 
will be promoted free of charge on the Talk41 1 phone and web services by the label 
"BayHit". For example, if ABC Pizza is rated high enough to be a BayHit, every time 
someone calls to find out info about ABC Pizza, it will be announced "ABC Pizza, a 
BayHit". Furthermore, when callers call Talk41 1 , they will be able to ask for "BayHits" 
in a specific Category and get connected. In one business model participation in the 





BayHits program will be free until brand is recognized and momentum is built with 
participating vendors. Subscription to the Talk41 1 service is a prerequisite to become 
a participating business in the BayHit program. 

Other optional services and procedures may also be provided. An optional 
Audio Talent brokerage portal provides a list of audio (voice) talents. In a reserved area 
of the web site a list of audio talents are provided to the business users. Once the 
business user clicks on the audio talent, a pre-recorded message is played back. The 
user selects to purchase the services of the audio talent of choice, and provides payment 
means, such as for example credit card or other account information necessary for the 
transaction. 

An Audio Coupon redemption tracking mechanism by auto decrementing a total 
count of purchased coupons. Audio coupons are purchased through the phone service 
or through the web site. Once the coupon message is saved it is ready for delivery to 
users. Once the user gets connected to the business who has purchased the coupons, a 
coupon counter gets decremented by one. The balance of available coupons is updated 
in the business account information provided to the business on the phone or on the web 
site. 

Promotional messages may be displayed in an Internet web-based search box. 
These messages may be rotated based on predetermined criteria, such as x many 
messages in y minutes. When the user hovers or clicks the mouse or other pointing 
device over the message box the message will get fixed. When the user hits return, a 
page from the Talk4 1 1 web site will be served with information about the business with 
the displayed message. If the user starts typing, the box will be cleared and accept the 
keystrokes of the user. 

Facsimile Business Information Input Service Embodiment provides for fax 
input of business information that gets scanned and converted to text and get played by 
TTS over the phone. In order to motivate and provide ease-of-use to the business users, 
a fax back service will be provided, where when the user calls in to the Talk41 1 service 
and wants to be a member, he/she will be faxed a form page with fields of information 
that need to be filled out and faxed back to Talk41 1 . This information will be scanned, 
edited and accepted to the system. Then a fax will be sent back to the new Business 
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User notifying that a membership is formed and that the user should call Talk41 1 to 
check and accept the information. 

Some of the steps in this procedure can be changed, left out, or combined to 
make the user interaction to be a satisfying experience as will readily be understood by 
workers having ordinary skill in the art in light of the description provided here. 

Exemplary Typical Call Flow for a Member Business Using Talk41 1 Service 

Some typical representative call flow scenarios are now described. These are 
merely illustrative as both the content and ordering of the content may change to suit 
local or region speech patterns, the goals of the call service, the demographics of the 
caller to the extent they can be determined, and numerous other objective and subjective 
or cultural factors. 

In this embodiment, a new business manager/owner calls in and registers to be 
a member. She/he is asked to provide a credit card number for business information 
verification and services. Then, she/he gets prompted to verify the existing information 
and subscribe 3 months free of charge to the basic services where she is told she can 
record her promotional or informational message at the prompt. Then she/he is given 
a chance to review the existing business information in the Talk411 database, edit 
information if not correct, review and publish. Then she is given a chance to review and 
publish her promotional or informational message and asked if she wants to learn about 
enhanced membership services. If she/he says yes, then she is briefly explained what 
those are and given a chance to subscribe. She/he is told she can always change her 
message, subscribe for additional services either on the telephone or on the 
Talk4 1 1 .com web site. 

First Exemplary Script for Consumer Call: 

"Hello, welcome to Talk41 1, Bay Area's fresh information source" 
"Please speak clearly. You can cut me off at any time, I won't get offended" 
"If you are a member or want to become one, say 'Yes', if you want me to Find a 
business for you, say the name, or the type of the business" 
ABC Pizza 

"ABC Pizza is at 408-255-0000, say "connect", or say "more" for more info" 
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More 

"There is no additional info. Do you want me to connect now or would you like me to 
recommend others in the area?" 
Others 

"Choice l:Domino. Choice 2: Straw Hat with 10% off on any large Pizza, say your 
choice" 

If answer is: Straw Hat 
"You will need the following number to get your discount. Write it down while 
I connect. It is: SHI. Again, it is SHI. Thank you for calling us" 

If answer is: Domino 's 
"All of Domino's Pizzas are made with real cheese and guaranteed to be hot on 
delivery! Thank you for calling us." 



Exemplary Script for Business Call: 

"Hello, welcome to Talk41 1, fresh info source for the Bay Area" 
"Please speak clearly." 

"If you are a member or want to become one say 'Yes', if you want me to find a 
business for you, say the name or the type of business" 

/ want to become a Member 
"What is your business name or business phone number?" 

ABC Pizza 

"I do have some info for your business. Please answer my questions so I can process 
your membership. Your first three months are free of charge. We do require a major 
credit card so we can continue to provide you with uninterrupted service and give you 
the ability to order audio promotions and coupons any time any where on the phone or 
at the Talk41 1 web site. We will not charge your card until your 3-month trial period 
is complete. Please provide a MasterCard or a VISA credit card number followed with 
the expiration date" 

xxxx xxxx xxxx xxxx and the expiration is 03/01 
"What is the full name on the card?" 

John Doe 
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"What is the zip code for the billing address?" 
94102 

"Please wait a moment while we process your information" 
(may play an ad here) 

"Thank you. Your password is PassOO 1 . Please write it down. Again it is PassOOl . Now 
you can dictate a short message that I will deliver to callers when they request your 
business information. You can change this message anytime here or at Talk4 1 1 web site 
by signing in with your password. Please dictate at the prompt or say Exit" 

ABC Pizza is located right next to the Great Mall and we serve anywhere in the 

Bay Area 

"This is how I understood your dictation: "ABC Pizza is located right next to the Great 
Mall and we serve anywhere in the Bay Area If this is not correct please repeat the 
message. If it is correct, say so" (after two tries, go to the Talk41 1 web site to type in 
your message). 
Correct 

"Congratulations. Now all callers who request your business info will hear your 
message. I can promote your business in other ways as well such as Category 
Sponsorships and Audio Coupons. Please visit our web site at www.Talk4 1 1 .com and 
give me a call again. When you are ready to purchase additional services you can say 
"Promote" any time and I will step you through it. Thank you for your business" 

Exemplary Script for Experienced Consumer Call: 

"Hello, wel... 

Pizza Coupons 
"What is your zip code?" 

94122 

"Say your choice any time to redeem your coupon and connect: 
Domino's two for one for any large pizza 
Straw Hat 10% all Medium size pizzas 
Big John's 50% off the second pizza. . . 
Big John 's 
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"You will need the following number to get your discount. Write it down while I 
connect. It is: BJ1. Again, it is BJ1. Thank you for calling us" 

Exemplary Script for Experienced Business Call: 

"Hello, wel... 

Member ABC Pizza 
"Please say your password" 



"Do you want to change your message, or purchase new promotional services?" 
Promo 

"Extended Message, Category Sponsorship, General Sponsorship or Audio Coupon?" 
Extended Message 

"This feature is an additional $20 per month. You will be able to provide the 
information most callers are requesting for your type of business by answering the 
questions I will ask, or you can leave a general message X seconds long. Say 
"Template" or "General message" to pick one" 

Template 
"Do you deliver?" 



"Do you charge for delivery?" 
No 

"Which zip codes or cities do you deliver to?" 
San Jose, Milpitas 

"What is your house specialty pizza and how much does an 18" one cost?" 

ABC Supreme and it costs $15 
"Do you offer per slice and if you do how much for a pepperoni/cheese slice? 

Yes, it is $2.50 per slice 
"What are the closest freeway and the name of the exit?" 

101, exit is 1 st Street 
"Do you take credit cards, if so which ones?" 

Yesrall major credit cards 
"Do you promote your specials on Talk41 1 service and the web site?" 



PassOOl 



Yes 



A-69342, 




-72- 



Yes 

"What the most ordered Pizza?" 

Supreme (if not understood invite to input this field at the web site, or try again 

later) 

'Thank you. All of this information will be displayed on the Talk41 1 web site and will 
be available to callers who are interested in listening." (Note: A feedback mechanism 
to verify the inputs are correctly recognized may also advantageously be provided.) 

"Do you want to purchase Audio Coupons?" 
Yes 

"Audio Coupons are sold in quantities of 100, 500, 1000, 5000, 10000. Price for 100 
is $1 each, 500 is 90cents, 1000 is 80cents, 5000 is 75cents and 10000 is 70cents per 
coupon. Please pick a quantity" 
One thousand 

"Please dictate the discount message you want to communicate" 

Purchase one large pizza and get a medium pizza free 
"Thank you. Callers who request to hear coupons or info about your business will hear 
your message. If you would like to be the first or second heard when callers ask for your 
category, please say "promo category sponsorship, otherwise say exit or hang up" 

Promo category sponsorship 
"Do you want to check availability by date or by preference order?" 

Preference order 

"#1 is available April 8 at $5000, #2 is available May 15 at $2500, and #3 is available 
now at $1000. All prices are valid for 3 months, and can be purchased for one, two or 
three months. Please say the pecking order you want to purchase followed by the 
number of months." 

#1 for 3 months 

"Your total will be $15,000. Would you like us to charge this to your credit card on 
record or would you like to send a check?" 
Send a check 

"You have till xx/00 to send your check. If not received by this date, your promotion 
will be cancelled, and a deposit may be required in the future. Please send your check 
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for $15,000 to Talk411 Service, P.O. Box 10000, San Jose, CA 95131, again 
(repeat). . . . Thank you for your business. You can check the status of your account and 
order additional services through the Talk411.com web site. Thank you for your 
business." 

It is noted that for experience callers, the scripted message may be interrupted 
by the caller so that his or her response may be acted upon more quickly. It is not 
necessary for the caller to listen to the entire prompt before responding. This feature 
reduces or eliminates potential irritation at having to wait to respond. 

Exemplary Script and Call Flow for Caller Interaction: 

Caller interaction with the inventive system is important to the commercial 
success of the system, method, and business. In general, consumers do not tolerate an 
interaction that is overly rigid or inflexible, or that requires carefully formatted 
responses. FIG. 7 is a flow-chart diagram setting out the caller flow and interaction 
with the system for a method of interaction between the inventive system and a caller. 
The flow-chart of FIG. 7 is broken into FIG. 7 A - 7E for convenience in presentation. 
The flow-chart includes call flow procedures as well as procedures for articulating 
messages to the caller and for receiving inputs from the caller and may serve as a basis 
for developing the call handling procedure and the talk41 l.vs script file. 

With reference to FIG. 8 (FIG. 8A-8B), an exemplary embodiment of a second 
different call handling procedure 700 is provided. This is another example of a call 
handling procedure which, like the others provided here, may form the basis for a VOS, 
voiceXML, or C++ based script implementation. It starts 702 with receipt of a 
telephone call 703. The call may be directed in one of two ways, to member services 
704 or to information services 750. For the member services path 704, a determination 
is made as to whether it is an existing member or a new member 705. If it is not new, 
then the member may be either a member 706 or a business member 718. If the 
member is a business member 718, then the members ID is obtained and verified 719. 
Next a check is made as whether they wish to check a rating or request information on, 
make or modify a promotion 720. If they choose to check their rating, they are given 
the requested rating information 721 and then asked if they wish to check their rating 
or promotion again 720. This allows them to sequentially address both rating and 
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promotional issues without requiring a call back. The member is then permitted to 
select between sponsor and coupon paths 724. Should the business member choose 
coupons, they are provided with information concerning coupon options and/or coupon 
pricing 725 and then given an opportunity to buy coupons or obtain additional 
information 727. If they instead select sponsor, they are provided with information 
concerning sponsorship category options 726 and then permitted to buy a sponsorship 
or obtain additional information 727. Should the business member then chose to 
receive neither coupon nor sponsor information, the call terminates or ends 749. 

In the caller chooses to receive additional information, they receive a benefits 
pitch 728 and are then given an opportunity to subscribe or verify 729, and if the caller 
business member chooses subscribe, a request is made for credit card information (or 
other account, billing, or payment information) 743. Credit card (or other account, 
billing, or payment information) is also collected if the earlier caller decision was to 
buy coupons or a category sponsorship 727. If the caller chooses to verify 729 and the 
information provided cannot be verified 730, then the caller is connected to customer 
service 731. Customer service may be an automated system, a live operator, or a 
combination of the two. If the verification can be made 730, then the caller is asked 
to select 744 if they wish to subscribe in which case the credit card information is 
obtained 743, end the call 745, or obtain more information (including at this point 
information that may only be available from a customer service representative 73 1 , but 
also from a benefits presentation or pitch 728. After credit card information has been 
obtained (and desirably verified) 743, the caller is asked whether they wish to record 
the promotional message or coupon over the phone at that time or using one of the 
available internet web based recording interfaces 747. Recall that the web based 
interfaces permit voice recording, typing the promotional item for automated text to 
speech conversion, utilizing a voice talent, using a synthesized celebrity or historical 
characters voice, or other option. If the caller chooses to record now, the caller is given 
an opportunity to record and verify the recording including opportunity to re-record 
until satisfied 748. After selecting the recording option, the caller is again asked to 
choose other options relating to sponsorship and coupons 724 as already described. It 
is noted that other options and call flow may be added or modifications made and that 
the particular choices available at any point may readily be altered. 



A-69342, 



-75- 

It is further noted with reference to the flow chart in FIG. 8, that after the caller 
selected member services 704 and identified themself as a new member 705, somewhat 
different information is required before being asked if they wish to subscribe or verify 
729 or be connected to customer service 73 1 , where the afore described procedures are 
repeated for the new business. In particular, the system asks if they are a business 737, 
and if they identify themself as a business provide their business name and telephone 
number 740. It the matching business is found, they are asked to subscribe or verify 
729 and which point they make a selection and proceed as already described for existing 
members. 

In on the other hand they are not a business 737, they are requested to provide 
their telephone number and password 738, and to chose between setup over the 
telephone and web based set up 739. If they choose web-based set up, set up will occur 
over the web at a later time, and the call ends. If they do not select web set up, the 
caller is given an opportunity to learn about specials and avail themself of specials as 
desired 713. The caller may then choose 714 to connect to a business or modify from 
a list 715. This opportunity is repeated so that the caller may select different available 
options. Once the caller chooses to connect to a business, the caller is connected 716, 
and the call to the service ends 717. 

Referring back to the point in the procedure when the caller selected member 
services 704, and indicated that they were not a business member 718, but rather an 
other member 706, the members identity is obtained and verified 707. That caller 
member is then asked to choose between ratings, points check, or My 708. If the caller 
chooses My, the caller presented with specials 7 1 3 as previously described. If the caller 
selects ratings check the caller is advised of the business and rating 710, and if the caller 
selects check points, is advised of the current points earned 709. The rate/check/my 
loop iterates so that the caller can cycle through and obtain more than one type of 
information while connected. 

Again referring to FIG. 8, the incoming call 703 may be a request to obtain 
information services from a non-member (or even from a member) caller 750. Usually 
the request will be for one of four possible information types, though other types may 
clearly be added, and some existing ones deleted. A request may be for a yellow pages 
(YP).type request where the caller desires to find matches in a category 752. When 
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such request is made, a predetermined number of matches are provided, in this case 
three are provided at a time, with a total of up to nine if additional matches are 
requested 756. These numbers may readily be modified according to response and 
need. A different request may be to find coupons 753. Agin three are initially provided 
in the requested category, with up to fifteen being provided as requested 757. The 
request may also be a request to find BayHits 754, and in similar manner three matches 
are initially provided with up to nine total matches as requested 758. After any of these 
three options are exercised 752, 753, 754, the caller is given the opportunity to select 
one of the matches or request more 759. 

The caller may alternatively make more specific 411-type (but with added 
features) directory assistance type request for the name of a business 75 1 , optionally 
specifying the name and coupon number 755. In this situation as well as when one of 
the other options were selected 759, the caller is given an opportunity 760 to connect 
to the business 763 or to obtain more information about the business 761 before 
connecting. If the caller chooses to connect 763, the call is announced along with any 
promotion or coupon promise 764, at which time the call to the service ends 765. If the 
caller requested more information 761, such information is provided, and the caller is 
asked to choose between connecting to the business 763, ending the call 765, or 
returning to information services 750. 

Internet Web-Based System And Method 

The inventive system and method are also implemented from an Internet or 
other networked computing environment using conventional keyboard, mouse, and 
display interfaces or with the addition of voice input and speech-recognition capability. 
Therefore it should be understood that the system and method described relative to the 
voice recognition embodiment may be implemented from a personal computer or other 
information appliance that supports voice input, such as using a microphone and 
modem connection to connect over an analog or digital telephone or other radio- 
frequency, optical, cable, or wireless communication link. Alternatively, though with 
some sacrifice in ease of use, the voice recognition component may be substituted by 
a typed or character string based interface. In this light, rather than repeating the 
various embodiments of the invention for an Internet Web based implementation, this 



A-69342/RM 



-77- 



section shows various web screen shots and the manner in which web pages are 
organized and navigated to provide an easy efficient and enjoyable experience. It is 
further noted that certain options are provided in the voice-recognition implementation 
to conduct some of the more detailed and time consuming steps at a later time using the 
talk411 inventive system. 

The Talk411 Internet application is developed using Microsoft's IIS (Web 
Server) and Active Server Page (ASP) technology. The Active Server Pages access the 
same database as the Voice system and deliver similar information. The Internet 
web-based Talk41 1 application includes both HTML and ASP files, both of which file 
types are well known in the art. Basically ASP files are HTMP files that include some 
active content. Exemplary html and asp code is provided for the getbusiness.asp file, 
a portion of which pertaining to "Dim Business" is duplicated in Table X. Angle 
brackets identify key words, and anything that starts with a bracket and a % sign is 
active server page code. The indicator "html" is a label for the html page. The portion 
of the code identified by "DIM BUSINESS". 

For the Internet or world wide web based application All active server page files 
(.asp file) are html pages with a little code that can create some html on the fly. In this 
case, since we are reading from a database and want to display the name, pre-written 
html does not have database interaction, nor does it know what database item it will 
need to provide or display. Therefore, there is a need to use active server pages (.asp) 
where you can put in a little code that instruct to go to data base to get name and display 
the information that was retrieved. An example of a portion of active server page code 
is provided in Table XL 
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Table X. ASP and HTML Files for Exemplary Internet Web Site 


Default.htm 


Main menu page (601) 


Business.asp 


Gives user a list of valid businesses to select from (602) 


Types, asp 


Gives user a list of valid business types to select from (603) 


BayHits.asp 


Displays business that meet the BayHits criteria (604) 


Coupons, asp 


Gives user a list of businesses with coupons to select from (605) 


Members, asp 


Member login form (606) 


GetBusiness.asp 


Displays information for a given business (607) 


GetTypes.asp 


Gives user a list of valid business for a specified type (608) 


MemberMenu.asp 


Validates member ID and password, displays menu (609) 


UpdateBusiness . asp 


Validates business information from MemberMenu.asp for 




update (610) 


RateBusiness . asp 


Validates business rating and updates database (611) 


Rate.asp 


Allows user to select a rating for a business (612) 


InvalidRater.htm 


Displays if rater number is incorrect (613) 


InvalidMember.htm 


Displays if member login is incorrect (614) 


Partners.htm 


Displays Talk41 1 partner list (615) 



Table XL Exemplary portion of active server page and html page code for 
talk41 1 Internet application. 

<% 

Dim Business 

Business = Request ( "Business " ) 

%> 

<form name=TypeForm method=post action= "GetTypes . asp" > 

<% Set conn = CreateObject { "ADODB . Connection" ) %> 

<% conn. Open "Driver=Microsof t Access Driver 

(*.mdb) ; DBQ=c : \inetpub\clients\talk411\directory2 .Mdb" %> 

<% Set rs = conn. Execute ( "SELECT * FROM [Customers] WHERE [CompanyName] = * " & 
Business & " ' " ) %> 

<hl><%= r s (" CompanyName " ) %></Hl> 

<h2>Phone: (<%= Mid (rs ( "PhoneNumber " ) , 1 , 3 ) %>) <%= Mid (rs ( " PhoneNumber " ) , 4 , 3 ) 
%>-<%= Mid (rs ( "PhoneNumber") , 7, 4) %><BR><BR> 
<%= rs ( "Address") %><BR> 
<%= rs("City") %></H2> 

<h3>Specialty : <%= rs ( "Specialty" ) %></H3> 
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Those workers in the art will appreciate the flexibility available in web page 
design and construction using html alone or in conjunction with asp, therefore only a 
few examples of the above listed web pages are illustrated to provide examples of a 
relatively simple graphical user interface. The use of various buttons, hot spots, pull- 
5 down menus, imbedded GIF files and the like are well known in the art and not 

described in any detail here. 

FIG. 9 is an example of the Coupons. asp and gives user a list of businesses with 
coupons to select from (605). This particular embodiment of the coupons web page 
interface provides Find A Business, Search By Type, BayHits! , Coupons, and Members 
10 button or menu bar. The coupons web page also provides a scrollable list for 

Businesses with Coupons. 
"5 FIG. 1 0 is an example of a GetBusiness.asp page (607) and provides information 

ffl for a particular business identified by the user. The web page presents similar buttons 

jH to those on the Coupon web page so that the user can readily navigate between the 

: !J 15 pages using a common interface. The get a business page also provides a menu button 

W for selecting another business in the same category. In this case, other auto repair and 

□ service. BayHits and a rate me menu button are also provided. 

pj FIG. 1 1 provides an example of a member menu web page generated by a 

MemberMenu.asp file that validates member ID and password, displays menu (609). 

O 

y* 20 It provides a box for entry of a member ID number and another box for entry of the 

member Password. The user enters this information and clicks a submit button to 
submit the information and begin processing. Security measures as are known in the 
art may optionally be implemented. 

FIG. 12 provides an example of a member menu for purchasing and editing a 
25 coupon message. Other embodiments of the web page for coupon generation provide 

templates for entering and arranging the information so that the member may try 
different alternatives for web based coupons that may be printed. It may also provide 
an interface for picking voices, sampling voice talent, selecting celebrity voices, and 
selecting simulated voices for text-to-speech conversion. These may be tested at the 
30 web site and made available for the telephone handset based implementation. 

FIG. 13 is a diagrammatic illustration showing the relationships between the 
various web pages and web page asp files described in Table XI. Various other paths 
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between web pages are possible and may be used. Alternative web pages may be 
designed, and features from multiple web pages may be combined into a single or into 
fewer web pages. 



Additional Description 

The foregoing descriptions of specific embodiments of the present invention 
have been presented for purposes of illustration and description. They are not intended 
to be exhaustive or to limit the invention to the precise forms disclosed, and obviously 
many modifications and variations are possible in light of the above teaching. The 
embodiments were chosen and described in order to best explain the principles of the 
invention and its practical application, to thereby enable others skilled in the art to best 
utilize the invention and various embodiments with various modifications as are suited 
to the particular use contemplated. It is intended that the scope of the invention be 
defined by the claims appended hereto and their equivalents. All publications and 
patent applications cited in this specification are herein incorporated by reference as if 
each individual publication or patent application were specifically and individually 
indicated to be incorporated by reference. 



